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Preface 


Please Read This Before Reading the Text! 


This book explains how to perform both simple and complex statistical 
calculations using APL. “‘APL’’ is an acronym for ‘‘A Programming 
Language’’—a computer programming language that is ideal for the com- 
putational work done in statistics. 

The authors are both economists, and the content reflects their profes- 
sional interests. However, political scientists, physicists, sociologists, in- 
dustrial psychologists, public health and dental researchers, and others 
have used this book and found it helpful. 

No previous knowledge of computers, computer programming, or meth- 
ods involved in statistical computation will be needed to understand this 
book. You will start from the most elementary statistics and progress to 
more complicated procedures on a gradual step-by-step basis. The numer- 
ous examples, exercises, and statistical applications are drawn from a 
variety of fields. Emphasis is placed on how to obtain the statistical results 
with ease. Using this book you will be able to perform computations that 
otherwise would be so cumbersome or time-consuming that you would not 
do them. You also will be able to perform experiments and computer 
simulations with relatively little effort. 

The APL statistical procedures presented are useful to researchers, 
analysts, managers, and anyone concerned with statistical calculations. We 
believe that when you have seen how easy it is to perform these computa- 
tions, you will be as pleasantly surprised as we were. If you are familiar 
with computers here is a dramatic example of the simplicity of APL com- 
pared to the FORTRAN statements used to compute the arithmetic mean. 
If you are a novice in these things don’t be frightened—everything will be 
explained. 
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of FORTRAN 
and APL 


Use of & 
in Multiple 
Regression 
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FORTRAN APL 
DIMENSION X (1000) XO 
READ (5,99)N CA VE<(+/X) +X 


99 FORMAT (14) 
READ (5,100) (X(D, I=1,N) 
100 FORMAT (9F8.0) 


SUM = 0.0 
DO 10 J = 1,N 

10 SUM = SUM + XJ) 
AVE = SUM/N 


WRITE (6,20)AVE 
20 F@RMAT (F10.4) 
END 
To estimate the parameters of Y = B, + B, X, + B,X,+-°+: + ByXy tu 
via multiple regression, you could type in APL: 


B<« YHY 


In other computer languages an equivalent program might take SO state- 
ments. 

This book is not just an introduction to APL programming, although 
many people have learned APL from it. Certainly it is not a statistics 
textbook, but readers have commented that they never really understood 
certain statistical concepts until they ‘‘tried real numbers to see how the 
formulas worked.’ This book is a valuable aid to understanding statistics 
because it actually computes results and even displays probability distribu- 
tions graphically. By the time you finish you will know a lot about APL 
programming. And after you spend a few hours at the computer, you will 
find that it is easier to program your own work than it is to learn to use the 
‘“‘canned’’ (FORTRAN) routines available at the computer facility. More 
importantly, you will understand what you are doing and how the results 
are obtained. We have long maintained that the less you are asked to 
accept unquestioningly, the better is your intellectual health and the 
greater will be your interest in statistical subjects. 

This book is not primarily a textbook. It is a book for the person who 
understands basic statistics, who wants a painless way to compute results, 
and yet wants to know what is really going on. We think that teachers of 
basic or applied statistics and especially econometrics will find our ap- 
proach using APL to be an important part of a practical statistics course. 
Students are often assigned ‘‘artificial,’’ ‘‘theoretical,’’ or ‘‘academic’”’ 
problems, situations, and exercises. These assignments are not made be- 
cause the instructor thinks such things are important. Actually, most in- 
structors understand the difficulty of tackling real statistics problems. Con- 
sequently, when the amount of computational pain the student (and 
teacher) must go through to get the statistical result is compared to the 
“‘statistics’’ that can be taught, a stress on pure theory almost always 
results. Thus, after a course (or even several courses), an individual may 
be unprepared to solve the first problem—how to perform the calculations! 
The use of APL minimizes these difficulties. 


Purpose of 
These 
Comments 
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We think that when you complete APL-STAT you will agree— 
programming can be easy! 

Because the text proceeds in a carefully structured sequence it is impor- 
tant that you follow it exactly and that you make sure you thoroughly 
understand each section before moving to the next. Later sections assume 
that prior sections have been mastered. You should do the exercises and 
check your answers in the back of the book. Above all, you can teach 
yourself a lot by experimenting, so try it. 

If you forget something, the primitive function glossary at the back of 
the book will help you recall earlier material. If you need more informa- 
tion, the side of most pages has brief comments. These comments contain 
the name and symbol of the APL operator introduced on that page. You 
will be able to flip through the book quickly and locate what you want, 
using the comments. They also provide a quick visual guide to the major 
topics in any section. 

We have a request. In the back of the book is an error sheet for recording 
our omissions, bad language (though never foul!), and other sins. We would 
be most obliged if you would send us this error sheet with your comments. 
The next edition will then be much better with your help. 


JAMES B. RAMSEY GERALD L. MUSGRAVE 


Note to Instructors 


Instructors can assign much more meaningful examples and exercises 
using the procedures in this book than using either canned programs or 
hand calculation. Students will not be spending time in tedious calculation 
or in using the computer as a black box. Students will be able to perform 
calculations, including complex matrix algebra, know how they are done, 
and see the numerical results. They will be able to obtain results they 
understand. One example is where a multiple regression model requires 
the intercept to be ‘‘forced’’ through zero. It is surprising how simple the 
mathematics of this is (not having a column of ones in the regressor X 
matrix). It is also surprising how few preprogrammed packages allow this 
option. In APL you can modify your program to handle this change in a 
matter of moments. 

Computer simulation and generation of distributions become a relatively 
trivial task in the hands of an APL-proficient student. We could enumerate 
a long list of such examples, and once you start you will see them too. 
Also, we have included our benchmark program data on the Longley re- 
gression problem in Appendix B. You may find it interesting to compare 
the computational accuracy of APL programs with the canned ones on 
your home computer or at your computer center. 

In using this book as a text you might consider the following ideas. The 
titles of certain sections, e.g., The Normal Distribution in Chapter 6, are 
starred. These starred sections involve mathematical material which may 
be beyond the scope of an elementary course in statistics that doesn’t have 
a mathematical prerequisite. Any APL instructions introduced in such 
sections will not be used anywhere else in the text without reexplanation. 
So starred sections can be dropped without fear of losing some important 
information about APL. 

The book is carefully structured in that it follows the usual pattern of 
topics in the introductory statistics course and only uses as much APL as 
is needed to get the job done. Consequently, it is important that, except for 
the starred sections, the sequence be followed and sections are not skipped. 

If you decide to alter the presentation of statistical subjects, have your 
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students read the APL-material in sequence, even if they skip the earlier 
presentations of the statistics. A number of readers have used this ap- 
proach and found it to be satisfactory. In these cases the readers either 
knew statistics or were not interested in statistics per se. They wanted to 
learn APL and found this approach to be effective. One reason for this is 
that APL instructions are introduced to solve specific problems rather than 
presented in the abstract. 

Each chapter has a large number of exercises and applications. The 
exercises help in exploring the use of APL concepts, functions, and sym- 
bols. The statistical applications help extend the depth and breadth of APL 
use. Throughout the book, experimentation is encouraged to expand and 
intensify interest and understanding. 

An elementary nonmathematical course in statistics would usually stop 
at Chapter 9, which covers contingency tables, analysis of variance, and 
simple linear regression with one regressor. Chapters 10, 11, and 12 intro- 
duce various aspects of matrices and prepare the way for multiple linear 
regression analysis and topics that might be regarded as more ‘‘economet- 
ric.’’ You may find that the use of APL will allow you to cover Chapters 10 
through 13 as well. This is important since the rudiments of matrix algebra 
can be taught quickly using APL. The benefit will be that you can enable 
your students to master multiple linear regression and more complicated 
analysis of variance techniques more easily. 

Three administrative matters might be of interest. Many computer cen- 
ters have only a few APL terminals. Don’t let this apparent difficulty slow 
you down. First, if the terminals use a typing ball or a daisy wheel, the 
center can obtain APL balls or print wheels. They are easy to switch, are 
low in cost, and small adhesive labels are available for the keys. Second, if 
the terminals use a non-APL matrix printer or if the terminals are CRT’s 
without APL characters, another solution is available. A Mnemonic 
character set that substitutes for the APL symbols is available. The multi- 
ple regression example in the preface was coded as 


Y<YRX 


using the standard APL character set. In the Mnemonic character set it 
would be written as 


Y<«Y .DQX 


Appendix C contains both the standard and Mnemonic character sets. 
Third, some computers have implemented only the monadic version of 
domino. In this case you simply enter the following two lines 


VYDQX 
(EC (QX) +. xX) +. x ((QX)4.xY)V 


when you enter YDQX the result is the same as if YHX¥ had been entered. 

If in using the book you have any comments that would be helpful to 
others please pass them along to us and we will incorporate them in the 
next edition. 


Introduction 


1.1 Overview of APL 


APL is a powerful and versatile computer programming language. When 
you use this language to communicate with the computer it will be as if you 
were personally operating the machine. APL is designed to operate on 
small microcomputers no larger than a typewriter, on minicomputers the 
size of one or two office desks, and on large maxicomputers the size of a 
truck. No matter how large or small the computer, once you log-on to the 
system it will appear from your perspective that you have a one-to-one 
relation with the computer. The APL contained in this book has been used 
on micro-, mini-, and maxicomputers produced by a variety of manufac- 
turers. We found the APL language to be remarkably similar in all of these 
cases, 


Administrative Procedures 


The procedures used to log-on to the various systems that we have used 
vary greatly. Each computer center has its own administrative procedures, 
keywords, passwords, and account verification methods. In addition, you 
usually need to connect your computer terminal to the computer itself and 
this process can be mysterious at first. There is really nothing to this at all. 
Nevertheless, sometimes people who hang around computer centers make 
a big deal about the administrative and technical matters surrounding the 
use of the machine. The truth of the matter is that the procedure is much 
the same as getting a key for an office, registering for a class, or signing up 
for Little League. It’s a hassle. Every organization thinks that there is only 
one way to do it, and yet every way is different. Appendix A contains a 
brief description of how it is done at the Stanford and NYU computer 
centers, and on an IBM 5120 desk-top computer. This description should 


CLEAR WS 


)OFF 


State Diagram 
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allow you to understand better the procedures that are used with your 
computer. In a short time the mystifying intricacies of gaining access to the 
computer become second nature. You type a few words and numbers and 
you are ready to go. 


The APL Keyboard 


We have included a few diagrams of typical APL keyboards in Appendix 
A. The alphabetic characters are in exactly the same position as they are 
on a standard typewriter. These letters are all capitals but (wouldn’t you 
know it) they are in the lowercase positions. Holding the shift key down 
while pressing a specific key enters a special APL symbol. Each of these 
symbols performs a specific operation in APL. As you can see, the 
keyboards are almost identical, and in the very few instances where some 
minor differences do exist we will explain them. One of the most frighten- 
ing things that the new APL programmer encounters is the APL character 
set. All those strange symbols are indeed foreboding. However, our ex- 
perience has been that the symbols are easy to learn. They are not much 
more difficult to learn than the international road signs, especially if you 
take them one at a time in the context of an actual problem. 


Some General Features of APL 


Now suppose that you are sitting in front of the keyboard and you have 
logged-on. The computer has responded with the message CLEAR Ws. The 
computer is indicating that you have been allocated a part of the 
computer—APL calls it a Work Space—named CLEAR. Now you can 
communicate with the computer, and it is in fact much like an electronic 
hand-held calculator except that it is much more powerful. To turn off the 
computer you simply type )OFF, for example, and log-off. You will soon 
see how APL can be used as a very powerful calculator in the immediate 
execution mode. However, it can do much more. 

You can define a set of instructions that will perform tasks such as 
balancing a checkbook; computing means, standard deviations, and re- 
gression coefficients; or directing the computer to simulate a Las Vegas 
casino game. In APL, the set of instructions is called a defined function. 
After the function has been defined you simply refer to it by name. The 
same instructions, operating on different data, can be used over and over 
again. 

Figure 1.1 is a state diagram that represents these three APL modes. 
When you log-on you are given a clear work space, you are in immediate 
execution mode, and you have a powerful calculator at your disposal. You 
can enter data, process the data with a one-line APL expression, define an 
entire new work space with different functions and data, and test your 
functions on a line-by-line basis before you program the whole set of 
instructions. 


Figure 1.1 
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Then you can define your own function, edit any part of it, or modify it 
for a particular application. Also, should a function stop because of a 
programming error and further processing thereby be suspended, you can 
correct the error by editing the function and then resume the function’s 
execution from the point of suspension. You need not start from the begin- 
ning if your previous calculations were correct. 

A function is executed by simply entering its name. You can specify the 
particular data set to be processed, and your function can call other func- 
tions, request data, and produce results for use by other functions. In 
addition, you can trace the execution of your function by having the results 
of any line or group of lines displayed—all of this without having to write 
any output statements. When your function’s execution is completed it 
returns you to immediate execution mode where you began. We hope that 
this sounds simple, straightforward, and like something you can do— 
because it is! 


1.2. Road Map of Where We Are Going and How We Will Get There 


In the next chapter you will learn how to use APL as a calculator. After 
these basics are under your belt, the general presentation is to explain a 
statistical problem and then to solve it using APL. On the way to the 
solution the various APL functions and programming methods are pre- 
sented and explained. We first discuss the sample mean and median, stan- 
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dard deviation, covariance, and higher order moments. Then we investi- 
gate a number of the most prominent statistical distributions, including the 
binomial, Poisson, and normal density and cumulative distribution func- 
tions. After you learn how to handle more complex data structures in APL 
and to write more general and powerful functions you will learn how to 
diagnose and correct programming errors. After you go through a case 
study using APL in a research project, we present an introduction to 
elementary linear correlation and regression, analysis of variance, and the 
chi-square and F distributions. Next we show how to do matrix algebra in 
APL, including the operation of matrix inversion, which is performed with 
one symbol, &. Multidimensional arrays are discussed in Chapter 12, 
where the various APL functions are explained in relation to these higher 
order arrays. The final chapters concentrate on computational statistics 
related to multiple linear regression, two-stage least squares, instrumental 
variables, Aitken estimators, Durbin’s First Order Autoregressive Models, 
and K-class estimators including limited information maximum likelihood 
estimators. 

Don’t let this impressive sounding jargon put you off. The first half of the 
book has been understood by good high-school students, and they were 
able to write APL programs after only a few hours of study. The later 
chapters have been used in both undergraduate and graduate classes. Also, 
the statistical routines have been used by a number of our colleagues in 
their statistical research. So you can see that while much of the material is 
technical, it progresses at a measured rate. Figure 1.2 is a schematic rep- 
resentation of APL-STAT. It might help you to visualize how the various 
components of APL are related. 

We can summarize our position this way: 


APL 
TRY IT—YOU’LL LIKE IT 


So turn the page and let’s go. . . 


Getting Started 


2.1 Some Keying Conventions 


Now that you are seated comfortably in front of your terminal or minicom- 
puter, everything is switched on, and the terminal is set to receive your 
instructions in APL, we can begin. Our first task is for you to gain some 
familiarity with the use of your keyboard as shown in figure 2.1. 

Sometimes we want to indicate to you very clearly that there is a blank 
space. For example, consider the character string ABCD EFG, which you 
type by hitting the A, B, C, D keys, the space bar, and then the keys E, F, 
and G. Blank spaces will be indicated, but only when we need to stress that 
there is a blank, by printing an ampersand (&) in a subscript position. In 
the above example, we would print ABCD, EFG. You can read this as: A, 
B, C, D, and E, F, G. You will not find the & character on your APL 
keyboard; we use it in the earlier chapters to emphasize blanks until your 
eye is accustomed to the idea. 


2.2 Simple Arithmetic 


Arithmetic 
Functions 
+,-,X%,F 


We will start by making sure we know how to add (+), subtract (—), 
multiply (x), and divide (+) numbers on the computer. The symbols +, -, x, 
and : are the symbols for the mathematical operations of adding, subtract- 
ing, multiplying, and dividing, respectively. On the IBM 5120, for exam- 
ple, they are found on the far right-hand side of the keyboard, next to the 
keys with the integers from 0 to 9. You may also use the numbers shown on 
the top row of the main keyboard and the arithmetic functions shown at the 
end of that row (this is the most common configuration). 

You instruct the computer to perform a calculation by hitting the RE- 
TURN or EXECUTE key; the instruction is executed only after you hit 
the key. 
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Figure 2.1 IBM 5120 ; < - 
desktop computer ‘ “¢ as 
showing the APL 
character set, numeric 
pad, and special 
function keys. 
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This photograph showing a 5120 desktop computer can be programmed 

in either Basic or APL with the flip of a switch. The keyboard is exactly 
like a standard typewriter in that pressing the shift key (either of the keys 
with the wide arrows on the bottom rank of keys) results in the APL charac- 
ters being entered into the computer. A convenient feature is that by 
holding the command key (CMD, on the far left) and pressing one of the 
keys on the top row will produce an entire command. For example, holding 
down CMD and pressing | results in the command )LOAD being entered 
automatically. 


IBM 5120 showing 
keyboard characters 
that can be entered 
using the command key. 
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This photograph shows the special overstruck characters that can be 
produced with one stroke. The command key is held down and any of the 
individual keys now represents a new symbol or combination of key strokes. 
For example, pressing the CMD key and the F key results in the divide quad 
or domino function being entered. If the machine were in the Basic 
programming mode the characters INPUT would have been entered. Using 
the CMD key saves a number of key strokes and is a handy feature. 


Addition 
Add + 1+2 


If you did not get the answer after keying in the digit 2, hit the RETURN 
or EXECUTE key. Now try addition with decimals: 


1.2+0.6 


1.8 


SYNTAX ERROR 


Minus 


Negative Numbers 
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But if you key in 
1.,2t0.,6 
1 8 


This does not look right! What happened? 

Clearly, embedded blanks in real numbers (numbers that include a dec- 
imal) cause problems! So do not embed blanks in real numbers. You will 
understand why you got 1,8 and not 1.8 by the end of this chapter. 

Try 


1+ 
SYNTAX ERROR 

1+ 

A 


You have made an error and the symbol ~ (a caret) marks the point at 
which the error occurred. Unfortunately, we all get much too familiar with 
this symbol! The error was called a syntax error because the statement 
‘‘execute 1+’? is ungrammatical; it does not make sense to tell the com- 
puter to ‘‘add 1.’’ The computer’s response is to say, ‘Add 1 to what?”’ or 
‘*How can I do this?”’ 


Subtraction 


Key in 3, minus sign (-, which is next to the plus (+) sign), and 2, then 
EXECUTE: 


3-2 


or 


Notice that on the last response negative 1 was printed by the computer as 
~1. The superscript negative indicates the negative sign of the number and 
must be carefully distinguished from the — in —1. In the latter case 
the symbol - represents the operation of subtraction. How do we know the 
difference? By position. For example, 

“2 represents the number ‘‘negative 2’’; 

~2 represents the operation of subtracting 2 from whatever is to the left 
of -. 

How do you type the number ‘‘negative 2’’? This is done by typing the 
symbol ~, which is upper-shift 2 on the keyboard. Try it. 


2 


2 
(Remember to hit EXECUTE!) 


Multiply x 


Divide + 


DOMAIN ERROR 
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Now use the minus operator symbol key: 


-2 


2 


What happened here? 3+ gave a syntax error, why didn’t -2? The an- 
swer is that APL interprets the operation -2, when nothing is on the left, as 
the instruction ‘‘Make the argument (i.e., whatever is on the right) the 
negative of whatever it is.’’ Try - 2 and - -2. You get 2 in both cases. Try 
the following as well: 


+2 


So if the + or - functions are on the left of a number, the sign of the 
number is unchanged by + and reversed by -. But if the number is on the 
left of the function, you will get a syntax error. 


Multiplication 
2x3 
6 
3.52 
7 
Division 
Try 
643 
2 
542 
2.5 
Ory 
0 
4=0 


DOMAIN ERROR 
450 


A 


10 
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Arrays 


Blanks 
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We have hit another error. A useful mathematical convention is that divi- 
sion by zero is an undefined operation, and that is what the computer is 
telling you. In this case the syntax or ‘‘grammar’’ of the use of the function 
+ is correct, but the operation cannot be performed with the number 0; 0 
lies outside the domain of validity for the operation of division. But what 
about 0+0? Try it. 


0+0 
1 


Without going into details at this stage, merely note that this is another 
useful convention—in short, an agreement as to what to do with such an 
operation. 


We will now introduce you to the single most important aspect of the APL 
language, the array. An array is an ordered arrangement of numbers or 
characters. A simple example is a linear arrangement of numbers, such as 
1 3 2 4 5, or characters, such as AN, ARRAY. In one form or another, 
arrays play a vital role throughout this book. Try the following: 


3+263 
266 
and 

34+2,6,3 
596 


What has happened here? In the second example 2 6 3 is treated as a list 
or array of three numbers, viz., 2, 6, 3 inthat order. So an array of num- 
bers is created by separating each number in the array by a blank. Another 
way to do it is to key in 2,6,3 where the comma, instead of blanks, sepa- 
rates the individual digits. If you recall the comment made above about 
blanks inside real numbers, you will see that it is dangerous not only for 
you to embed blanks in real numbers in an array, but also for you to embed 
blanks within integers as well. 
Try the following: 


3+1.2,2.3,3.3,4.1 
4.25.3 6.3 7.1 

3+1.42,2.3,3,.3 
45 5.3 6 3.3 


If these results seem strange (or if you do not get either result) carefully 
check your keying of numbers and blanks. In the second example, the 
result shown occurs because 3 is added to 1., 2, 2.3, 3, and 0.3 in turn; 


APL and Arrays 


APL 
Functions 
and Arrays 
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the blanks denote from the right where one number ends and the next 
begins. Also try 


34+1.2,2.3,3.3,4.1 
4.2 5.3 6.3 7.14 
or 

341.,2,2.3,3,.3 
45 5.3 6 3.3 

1,2,3+3 
45 6 


Notice here that just as you can add a number to a list of numbers you 
can also add a list of numbers to a single number. 


St1, gag goOt 
4 5 367 


So much for the blanks. Now let us get back to the main issue: What 
is meant by adding 3 to an array of numbers? Quite simply, and as you 
would expect, 3 is added in turn to each of the numbers in the array. Now 
try 


4x2 3,4 
8 12 16 
452,4,8 
210.5 
2 it, 84 
0.5 12 
3424172 
101 
The general rule we see is that for any function f such as +, -, x, or +,a 
numbern, and an array a,,d2,.. . , a», the statement ‘‘n f array’’ produces 


an array nfa,, nfa,,...,nfa,, and the statement ‘‘array fn’’ produces an 


array a,fn, ajfn, ..., Gp)fn. 
If one array lets you do a series of operations all at once, what will 


happen if you use two arrays? Try 
1,2 ,3+1,2,3 
24 6 


Clearly, each element of the first array is added to the corresponding ele- 
ment of the second array. Similar results hold for the other arithmetic 
operations. 


12 
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But what if the two arrays have different numbers of elements in them 
(i.e., what if the arrays have different lengths)? There will be some ele- 
ments in one array to which there are no corresponding elements in the 
other. So if we try to operate on arrays of different lengths, we get a 
LENGTH ERROR. Try 


1,2+1,2,3 
LENGTH ERROR 
De DB 2D 8 
A 


But the following is fine: 


1 a2 got il &e as 
24 8 
Summary 

+,-, x, and + are arithmetic functions. 
**349°? adds the numbers 3 and 2; ‘‘3-2’’ subtracts 2 from 3; ‘'3+2” 
divides 3 by 2:‘°'3 x 2’ multiplies 3 times 2. 
‘*+ number’ returns the number. 
‘*- number’’ changes the sign of the number. 

Anarray of numbers is formed by entering numbers in a list separated by 
blanks or by commas. We represent blanks where necessary in this text by 
gq More complex arrays will be discussed later. 

Numbers, arrays, and the arithmetic operations that we discussed in this 
chapter can be combined as follows: 

Number f Number yields a number. 

f Number yields a number. 

Number f yields Syntax Error. 

Number f Array yields an array. 

Array f Number yields an array. 

Array f Array yields an array only if the arrays have the 
same number of elements (the ‘‘same 
length’’). It yields Length Error when the 
arrays have different lengths. 

Ff Array yields an array. 

Array f yields Syntax Error. 

Exercises 


APL Practice 
Let’s explore the use of the functions defined in this chapter: 


1. (a) +2 positive two 
(b) 2+ Syntax Error 
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(c) 142 one divided by two 

(d) -2 minus two 

(e) ~2 negative two 

(f) -"2 minus negative two 

(g) -2 the negative of minus two 

(h) +0 Domain Error 

(i) 340 Domain Error 

(j) 3#(2-2) Domain Error 

(k) 3+° 2 three plus negative two 

(I) 3-+2 subtract positive two from three 
(m) 3x 2 multiply three by negative two 

(n) 3x? 2 three times the reciprocal of negative two 
(0) 3x +2 three times the negative reciprocal of two 


(p) -3x+ 2 the negative of the answer to (n) 
(q) -3% 2 the negative of three divided by negative two 


2. You can get a better idea about the use of arrays by trying the follow- 


ing exercises: 

(a) Target? 

(b) 14 243-2 - 

(c) 1, 243+ 1253 - 
(a)C1 2 53)- ("22 8) 
(ey. 24 ee 128 

(f) Ly 253% 1,2, 3 

(g) 1, 28% 12,3 

(h) (1,°2,3)7( 1,4, 3) 


& & & 


(i) 1 "2 3% 14 73 
) 12, ee 2 
(k) 1,0,3*#1,0,3 


(tag: 24382 


Statistical Applications 


add a number to an array 

subtract a number from an array 

add two arrays 

subtract two arrays 

subtract two arrays 

multiply two arrays element by element 
same as (f) 

divide one array by another, element by 
element 

same as (h) 

Length Error 

Do you get Domain Error? Why or why 
not? 

add the number two to the array 2 3 and 
subtract the sums from one 


1. What is the arithmetic average of 10 and 20? 


2. What is the reciprocal of the arithmetic mean of the reciprocals of 10 
and 20? Verify that this number is smaller than the arithmetic average 


of 10 and 20. 


3. Find the volume of a cube whose sides are 4.5 ft long. 
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4. The following three measurements were taken on one side of a cube: 
0.00000060, 0.00000065, 0.06000063. What is your estimate of the vol- 
ume of the cube? 


5. Seven prices for a popular 35 mm SLR camera were collected from a 
recent photography magazine: $259.95, $245.00, $254.99, $259.99, 
$259.95, and $249.95. 

a. Compute an average price for the camera. 

b. What is the range of prices? 

c. How much more is the highest price than the average price? 

d. What would be the percent saved by purchasing at the lowest price 
compared to the highest price? 


6. A local business selected a representative week’s returned checks 
due to insufficient funds or fraudulent accounts. The checks were 
written for $23.41, $184.24, $73.12, $2.48, $32.00, $14.28, $58.61, 
$84.00, $41.41, $83.27, and $102.87. What would you forecast the 
yearly total amount of returned checks to be? 


Some Elementary 
Statistics 


If you have read the first few chapters in any book on statistics or econo- 
metrics, you will have noted that the sample mean appears quite promi- 
nently. In fact, if you continue using statistics you will be computing a 
large number of means. It will save a lot of time if we can discover a quick 
way to get the computer to do it. Before tackling our first statistic, we have 
to learn an important fact about how a computer reads instructions in APL. 


3.1 The Computer Reads from the Right 


Computer Reads 
From Right to Left 


In order to compute a mean, we need an array of numbers and a knowledge 
of how many elements (numbers) it contains. Suppose we have the array 
(1 23 4), which obviously has four elements in it, and we want to calculate 
its mean. Mathematically, the operation can be written as: 


(1+2+3 + 4/4 = 2.5 

In APL we can enter the following statement: 
(14+24+3+4) 24 

2.5 

Great so far. But suppose we entered: 
1+24+3+424 

7 


We made another mistake! But this one is a very, very important one to 
remember. In APL a string of mathematical operations is carried out from 
right to left. Since we are accustomed to reading from left to right, you 
can see that until you are used to the idea, you can make some bad 
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mistakes. Indeed, for the next few chapters you are strongly advised to 
practice reading all the computer statements from right to left. 

Consider the first example: (1 + 2 + 3 + 4) + 4. The computer does 
the following. Starting from the right, the computer recognizes a number, 
then a function requiring two arguments, such as :, x, -, or +, then a right 
parenthesis. This parenthesis tells the computer to Keep going to the left 
until it encounters a matching left parenthesis; then whatever is contained 
between the left and right parentheses is to be divided by four. Within the 
parentheses the computer recognizes a 4, the function, +, and then the 
number 3. It performs the operation 3 + 4 and stores the result. Proceed- 
ing to the left, it recognizes another function symbol, +, followed by 
another number, 2, so it adds 2 to (3 + 4), and so on. 

All of this is simple enough, so let us try a trickier example. Do this one 
by hand first and then check your result on the computer. 


1+2-3-4+5-6-7 
10 


If you got ~12 instead of 10, then that is exactly what we wish to explain. If 
we add parentheses, the above expression can be written as 


1+(2-(3-(4+(5-(6-7))))) 


10 
In case you haven’t got it yet, the following table should help: 


Operation 

Number Operation Result 
1 6-7 “4 
2 5-1 6 
3 4+6 10 
4 3-10 “7 
5 2-7 g 
6 1+9 10 


3.2 Two Arguments or One? 


A few paragraphs back we said that the functions +, x, -, and + require two 
arguments, but in Chapter 2 we successfully used the + and — functions 
with only one argument, provided that the argument was on the right of the 
function, not the left. At the moment all of this may be confusing, but it 
won't be after we show you how useful it is to have a function that can take 
either one or two arguments. 
First, a little terminology in case you dip into an APL manual or talk to a 
programmer friend: functions that take two arguments are said to be 
Monadic Functions dyadic, and those that have one argument are monadic; 1+2 is a dyadic use 
Dyadic Functions of +, +2 is monadic. The symbols for most functions are used to represent 


Reciprocal + 


SYNTAX ERROR 
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both an operator that is dyadic and one that is monadic—two functions for 
the price of one symbol! 
For example, the function + can be used in two ways: 


Monadic function: Symbol: +, function: reciprocal 
Example: +2 


iD 


Dyadic function: Symbol: +, function: division 
Example: 4:2 


2 


In the first case the symbol + indicates the reciprocal (or 1 + argument); in 
the second case the symbol + indicates the operation of division (ar- 
gument, divided by argument,). 

The symbol — represents two functions—the monadic function of arith- 
metic negation (more simply, ‘‘changes the sign’’) and the dyadic function 
of subtraction. The symbol + represents addition in its dyadic form; in its 
monadic form it preserves the identity of the argument, i.e., + number 
returns the number itself. The symbol x is used for both the dyadic func- 
tion of multiplication and for the monadic function ‘‘signum,”’ which will 
be mentioned later. 

Recall that with monadic uses the function comes first, then the argu- 
ment. A number followed by a function and nothing else gives a 
SYNTAX ERROR. 

In using symbols that can represent different functions depending on 
whether they are being used monadically or dyadically, remember to read 
from the right. After a little more practice in the exercises you will soon 
find no difficulty in distinguishing monadic from dyadic uses of functions. 


3.3 Variables and Assignment 


Assignment 


If we want the mean of the array 1, 2, ~3, ~4, 5, ~6, -7, what should we do? 
Typing out (1 + 2 —- 3 —- 4+ 5 — 6 — 7) + 7 is incorrect; try it and you 
will see. (Remember to read from the right, performing each function in turn 
and storing the result.) Well, there is a very easy solution, but first we will 
find it useful to give arrays and scalars (a scalar is a single number) names, 
so that when using the array we can refer to the name instead of writing out 
the whole array each time. This procedure is called ‘‘assignment.’’ As- 
signment uses the key next to the P key. (Do not confuse this key with the 
shift control keys, which also have arrows on them. The latter keys are 
used for editing by moving text to the right or left, up or down. On the IBM 
5120 they are located next to the ATTN key; on other keyboards they are 
usually on the right next to the number keys or the numeric key pad.) Type 
out 


X41 42, 34 4avy 6, 7 
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Variable Names 
Valid/Invalid 


Results can be lost 
when you log-off 
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and hit the EXECUTE key. Nothing seems to happen. Try typing X and 
hit the EXECUTE key: 


Xx 


To OE By 
Success! We now have the array we want stored in the computer with a 
name, xX. ‘‘Executing’’ X tells the computer to print out or display X. Try 
N+7 
N 


Letters of the alphabet together with numbers, but only after the first 
character, can be used to define names of arrays or scalars. Special sym- 
bols for operations, spaces, punctuation marks, and so on cannot be used. 
Some examples of valid and invalid names are: 


Valid Variable Names Invalid Variable Names 
A 3A 
ABLE + 
B3C1 “A 
Z At 
Z (B) 
A_OR_B +C 
Ax 


Z is created by typing Z, backspacing once, and hitting the upper shift 
key. Z and Z are different names. A_O#_B is keyed by typing upper shift F 
for _. Keying in an invalid variable name with assignment produces a 
syntax error. 

An important question arises at this point. If someone defines a number 
of variable names by assigning values to them, what happens when he 
signs off the computer or turns off the power on his minicomputer? As one 
might suspect, all is lost! However, we will learn in Chapter 7 how to save 
important material for use at a later time. For now, remember that if you 
log-off after having assigned values to variables, the variables will not be 
defined when you log-on next time. 


3.4 A System Command: ) VARS 


System Command 


An aid in this regard is the system command )VARS. First, we have to 
define a system command. This is an instruction to the computer concern- 
ing the manner in which it carries out your APL instructions; system 
commands are rather like sending instructions to an operator who is keep- 
ing a constant record of all that you do on the computer. System com- 
mands are easily recognized; they all start with a ), a right parenthesis. 


)VARS 
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The use of ) VARS will illustrate the idea. Suppose, after a long session on 
the computer, you have forgotten which variables you have defined. An 
obvious idea is to ask the computer what variables you have used. But it is 
clear that we need some way to make sure the computer knows we are 
asking a question about the system and how it is operating, and that we are 
not making another statement in our calculations. In APL, the distinction is 
very simple: system commands begin with a right parenthesis, ), which is 
keyed as upper case ]. For example, typing 


)VARS 
N Xx 


instructs the computer to give us a list of the variable names we have 
defined so far. The computer responds with W and x. 


3.5 How to Calculate a Mean 


Reduction / 


We have now defined by assignment two variable names, X and N: an array 
X and the number of elements, N, in_X. This is all that we need to calculate 
a mean. The calculation is easy. Key in 


(+/X)4N 
“41.7143 


and we have indeed obtained the mean. But how? Let us try this again. 
Key in 


¥+2 468426 
(+/Y)tN 
4.5744 


Apparently the symbols +/, when applied to an array, add up the ele- 
ments of the array. Mathematically, for an N-element array X this is X, + 
X,+X3+: + + +Xy, or, more compactly, ¥ .,X;. The symbol / represents 
an operation on arrays called reduction, and reduction can be used with a 
large number of mathematical functions including +, -, x, and +. Let f 
represent one of the arithmetic functions. Then (f/array) tells the com- 
puter to insert the function f between each element of the array and then 
perform all functions, but remember that it does so from right to left! Thus 
+/Y produces (i.e., is equivalent to)2+4+6+8+4+2+6. 

As another example, suppose L is a variable name of an array with three 
elements which repres2nt the dimensions of a box, and you want to calcu- 
late the volume of the box. In APL, this problem is solved by typing x/Z. 
For example: 


[<3 2 5 
x/L 
30 
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Let us return to calculating the mean. It would be most convenient if we 
did not have to count the number of elements in an array. Why not have 
the computer do it? Why not indeed! For this we use a little symbol called 
the shape function, 0, which is the upper shift R key. Let’s try it. Type 


ox 
4 

0(142,3,4) 
ub 

Ple2ys et 
u 


So the argument of the function shape, p, can be a variable name or an 
array, and the result is the number of elements in the array. What about the 
shape (length) of the variable N, which is a scalar? Typing pW, for example, 
produces no response since a scalar has no dimension associated with it. 
As we will see, a scalar and an array with one element in it are different 
animals. 

When we calculated the mean of the array X, we remembered that the 
computer reads APL statements from right to left, so that writing (+/X) +N 
meant that the elements of X were added together and then divided by N. 
What would have happened if we had written +/+? Each element of X 
would have been divided by N, and then the array of results summed. Both 
mathematical procedures theoretically give the same answers, but the 
second method is both slower and less computationally accurate if N is 
very large. 

Let us review what we have learned about computing the mean of an 
array of numbers. Suppose you are given the array X. That is, X is in the 
computer ready for you to use, but you know nothing else about it. Prob- 
lem: calculate the mean and find out how many elements there are in X. 
Here is one solution: 


N<oXx 
M<(+/X)#N 
N 


M 
eee ec 
One thing to notice about the above is that: 


(a) if you perform a function and assign the result to a variable, the result 
is stored under the variable’s name and nothing is printed or displayed 
until you execute the name of the variable; 


Scan \ 


3.6 Two Other Measures of Central Tendency 21 


(b) if you perform a function and do not store the result, it will be dis- 
played immediately; 


(c) the values assigned by you to N and M will remain in the computer 
until you log-off or you redefine the variable name. For example: 


N 

7 
N<9 
N 

9 


Do you remember all the variables you have defined? Type in the system 
command )VARS and see if you are right—the computer knows! 

What should we do if we would like an array of the partial sums (some- 
times called running totals) or partial means of X? That is, suppose we 
want the array 


1 (442), (142473) 4 (1424734 4) g (14247 3+ Ut5). ong 


(14247 3+ 445+ 6+ 7) 


This is obtained by the symbols plus scan: \. The operation scan works in 
a manner similar to reduction except that after inserting the function ‘‘f”’ 
between each element of the array, the first element is kept, then the first 
pair of elements are reduced from right to left, then the first three, and so 
on. Try 


+\X 

130 41 5-12 
+\¥ 

2 6 12 20 24 26 32 


3.6 Two Other Measures of Central Tendency: The Geometric and Harmonic Means 


Geometric Mean 


The geometric mean of N values is the Nth root of their product. Mathe- 
matically, one has 


B= (XX X_ X Agr K xy) 


N UN 
+=(fhs 
1 


How might we get the computer to calculate the geometric means of the 
arrays X and Y defined above? We have to learn some new functions 
first. 


or 
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Raising a number to a power, taking logs, and related functions are 
computed as shown in Table 1. The mathematical function is given on the 
left, the corresponding general computer programming statement is given 
in the middle, examples are shown on the right, and the keying of the 
symbols is shown below the table. 


Logarithm and Note that both « and @ can be used as monadic (single argument) or 
Exponential * dyadic (two argument) functions. The first and third rows show the dyadic 
Functions uses and the second and fourth rows the monadic ones. 

Table 1 


Exponential and Logarithmic Functions 


Mathematical APL 
Statement Statement Examples 


e® (e = 2.7183) 


log ,A 
(log of A_to base B) 
log. A (or In A) 


M is Monadic, D is Dyadic. 
* is typed as upper shift P key. 
gis typed as upper shift P key, backspace, and upper shift O, to the left of P, not the zero key. 


Logarithm and * and @ are inverse functions of each other. For example, 
Exponential Functions 
@® * neo 
3 
383%*2 
2 


With the above functions we can now compute the geometric mean of an 
array of numbers. The geometric mean for the array DATA is: 


DATA+1.1 1.2 1.3 1.4 1.5 1.6 1.7 


N 
7 
G«(x/DATA )*1+N 
G 
1.3855 
and 
G<(x/Y)*tN 
G 


4.0679 


Harmonic Mean 
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In the former example, multiplicative reduction on DATA yields a result 
equivalent to the mathematical statement II*_, D;, where D; is the ith ele- 
ment in the array DATA. The remainder of the expression produces the Nth 
root of the product. 

The second example illustrates a practical use of the monadic function + 
that we discussed earlier, namely the inverse or reciprocal. 

The harmonic mean is the reciprocal of the arithmetic mean of the recip- 
rocals. Mathematically, 


. 
is n/(z (1x) 
i=1 
In APL, this is simply 
H+N+(4+/+4X) 


H+N+(+/4Y) 
H 


3.5745 


3.7 Sample Variance and Standard Deviation 


Sample Variance 


Parentheses 


Calculating means presents us with few difficulties. What about calculating 
the variance and its square root, the standard deviation? The mathematical 
formula for the sample variance is simple enough. 


(x; = x)P/(N ™ 1), 


Me 


i=1 


where N is the number of observations N; and x is their arithmetic mean. 

If we know x, the solution is apparent. Consider the following APL 
expression, which is a series of functions linked together to make up the 
APL equivalent of a mathematical expression. (DO NOT TYPE IT IN 
YET!) 


(x/((X-M)*2))4N-1 


The above expression was obtained by the following line of thought. Let 4 
represent the arithmetic mean x. Then the expression =(x,; — x)? is in APL 
+/(X-M)*2 ; the term inside the parentheses is anarray x, — X,x. —X,..., 
xX, — x, each term of which is squared, and then plus reduction is per- 
formed on the resulting array. Remember that the computer reads APL 
statements from right to left, and expressions in parentheses are evaluated 
as soon as they are encountered by the computer. In the above expression 
the array (X-M) is calculated first, then each element is squared. With a 
number of pairs of parentheses embedded in each other as above, the 
expression within the innermost parentheses is evaluated first, then the 
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Deviation 
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expression within the next outside ones, and so on. The resulting array is 
plus reduced (i.e., the elements of the array are added), and finally, the 
summed array is divided by (W-1). 

Now we are ready to try out our expression. First type 


) VARS 
G H L M N Xx Y 


just to make sure we still have and X stored in the computer. If you do not 
get V and X listed, then you probably signed off after you last used those 
variables. If that is the case, enter them into the computer again (X is given 
on page 18 and ¥ is obtained by 1px). Now type 


M<(+/X)+N 
Ve(4+/((X-M)*2))tN-4 
SD<VR.5 


19.905 
SD 


4u.4H615 


If you did not get the same results, check first to see if the mean value is 
the same. If it is not, your X array may not match that shown on page 18, or 
the value of may be incorrect. If Y is wrong while / is right, check your 
APL expression very carefully to make sure that it is exactly like the one 
shown above. 

One little hint about keeping parentheses properly paired up: going from 
right to left, add 1 each time you hit a right parenthesis and subtract 1 each 
time you hit a left parenthesis; when you are out of parentheses, the an- 
swer should be zero, because the number of right and left parentheses 
should be equal. If they are not, find the missing or extra parenthesis. For 
example, 


Ve(t+/(CX-M)*2)4N-1 
t AA t 4 
-t 01 2 1 


The count ends at —1, so we have either a missing right parenthesis or an 
extra left parenthesis. To find out which, go to the innermost pair of par- 
entheses and work outwards in both directions. Thus 


(X-M) looks alright 
((X-M)*2) looks alright 
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(+/((X-M)*2) here is the error, a 
+ missing right parenthesis 


If instead we were to delete the first left parenthesis, we would get the 
‘‘right answer,’’ but in a very inefficient manner. In the latter case the 
squared elements of the array (X-M) would each be divided by (W-1) and 
the quotients added. In the original expression, the squared elements are 
added and then the sum is divided by (4-1) once. 


3.8 Correcting Typing Errors 


In keying the above APL expressions you may have made some typing 
errors—a common error is to have too many or too few parentheses. So far 
it has been easy enough to hit RETURN, get some error message, and redo 
the expression. However, you can see that as your APL expressions get 
longer, this will become a nuisance, so let’s see how to correct a line while 
it is being typed, that is, before hitting the EXECUTE key. Backspace 
until the cursor on the terminal head (a little device that indicates where 
the next character will be typed) is at the beginning of your first mistake 
(i.e., everything to the /eft of the cursor is correct), then hit the ATTN 
(Attention) key. Now type the remaining part of the line. Alternatively, hit 
the ‘‘line feed’’ key on the right-hand side of the terminal. For example, 
suppose that you are working at a ‘‘hard copy’’ terminal (that is, one that 
prints on paper), that you have typed 


Ve (+/ ((X-'M)*2¢N-1 


and that you realize your error before hitting the EXECUTE key. 
Backspace to the division sign, hit ‘‘line feed,’’ which advances the paper 
one row (and tells the computer to add the new characters to the previous 
line), and then complete the line correctly. You will have 


Ve (+/((X-M)*2)+N-1 
)#N-2 


and the computer will correct your error as soon as you hit EXECUTE. 

Editing lines on the IBM 5120 and many other CRT™* terminals is even 
easier. You can simply backspace and type in the correct characters. Some 
terminals have the ability to insert characters within a line. You space back 
until you reach the last correct character, hold down a special key (‘‘com- 
mand’’ on the 5120 series), and press the right arrow on the top row of 
keys.** The result is 


V<(+/((X-M)*2)g 2M-1 


* Cathode Ray Tube—electronics jargon for a television screen. 


** This is true on terminals that have an addressable cursor. For others, the correction process is more 
elaborate. In some cases, each character may need to be erased. In others it may be easier to just 
replace the whole line. 
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In effect, you moved the four characters +¥-1 one space to the nght and 
held the cursor at its original point. You now type the missing ‘‘)’’: 


Ve(+/((X-M)*2) ) 2-1 


The procedure used for editing lines is specific to the computer system 
you are using and also to the particular terminal interfaced to that system. 
CRT’s generally provide the most flexibility, but having a written or hard 
copy of your session is often extremely valuable. You will have to consult 
the computer center personnel for specific editing procedures, as such 
procedures are not explicitly part of the APL language. 

If we refer again to the APL statements on page 24, we notice that the 
three lines of statements that calculate M, V, and SD must be executed in 
precisely the order shown. This is because the second line needs the result 
of the first, and the third needs the result of the second. We are beginning 
to discover that we will have to develop tools more powerful than those 
that we have used so far. This will be the subject of the next chapter. 
Meanwhile, we will conclude this chapter with a way of calculating sample 
means (arithmetic) and variances from sample probabilities of success 
(see, for example, Kmenta, Chapter 2). 


3.9 Mean and Variance of Sample Probabilities 


Suppose we are interested in estimating the probability of getting a seven 
when we roll a pair of dice. (Of course, it is easy to see that if we have 
clean, unloaded dice, the probability is 1/6, but we might want to check our 
dice.) One way to do this experimentally is to roll a pair of dice N times 
and then divide the number of successes (number of times you got a seven) 
by N. But this is merely an estimate. How might we estimate the mean and 
variance of this estimate? One way would be to repeat the above experi- 
ment a large number of times, say NN, and then to calculate the mean and 
variance of the estimates of the probability of a seven that were obtained in 
each trial. 

Suppose you have data obtained from NN = 100 replications of a dice 
tossing experiment in which N = 4 tosses were made. In any one experi- 
ment of four tosses you could obtain zero to four sevens—five possibilities 
in all. The estimated probability from each experiment could vary from 
zero (equal to 0 successes divided by 4, the number of trials), to 1 (4 
successes in 4 trials). As we suspect, if our dice are unloaded, PR, the 
probability of getting a seven is 1/6, PR = 0.166... . From each experiment 
we get an estimate, say PR, which can take one of 5 discrete values, viz., 0, 
0.25, 0.5, 0.75, and 1.0. Let PR, = 0, PR, = 0.25,..., PR; = 1.0. If NN 
equals 100, then we can count the number of times n; that we get each 
estimate PR,,i = 1,2,...,5, in 100 trials. These five numbers7,,72,..., 
n;, whose sum is 100, are called absolute frequencies. If we divide each n, 
by 100 we get five relative frequencies whose sum is 1.0. Let’s call the 
relative frequencies fr;,i = 1, 2,... , 5. fr; is merely the proportion of 
the NN repetitions of our experiment that yielded PR; as the estimated 
probability. That is, fr; = n;/NN. 
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The first thing that we must determine is the mean estimate PR. The 
mathematical statement of the answer is simple: MF = 23_, fr,PR;, where 
MF represents the mean of the sample probabilities obtained from the 
observed relative frequencies fr;. (MF is a sample mean of sampling pro- 
portions PR,;,i = 1,2,...,5.) The variance VF is given by the expression, 
VF = 35_, fr(PR,; — MF)?. 

The calculation of MF and VF in APL, although straightforward, intro- 
duces us to yet another function. Let us suppose that we have the follow- 
ing data from a sampling experiment in which NN = 100: fr, = 0.01, 
fr. = 0.06, fr; = 0.28, fr, = 0.42, fr; = 0.23. Enter the APL statements 


FR«0.01 0.06 0.28 0.42 0.23 


PR+0.0 0.25 0.50 0.75 1.0 


We now have all the data we need ready and waiting in the two arrays FR 
and PR. 

To get what we want requires an operation called the ‘‘inner product,” 
and the version we want here is typed by keying plus, period or decimal 
point, and then multiply. The expression for yF then is simply FR+. PR. 
The APL code tells the computer to take in turn each element of the array 
FR, multiply it by the corresponding element in PR, and add the products. 
The calculation of the sample mean and variance may thus be carried out 


by typing 


MF<FR+.*PR 
VF+FR+ .x (PR-MF)*2 


MF 


VF 


From our results it would seem that our dice are definitely loaded! 

Now that you have learned how to calculate some basic statistics, you 
will be anxious to try your hand at some more realistic data. If you try to 
enter somewhat more data than we have been using so far, you will run into 
a little problem. The problem is that the computer will limit the amount of 
data you can enter. The limit is usually 80, 128, or 160 characters. You get 
to the right-hand end of a line and you either cannot enter more data, or the 
new data replaces your previous entry. 

The solution to this problem is not difficult. Whenever you want to 
continue entering data on the next line, simply close off the current line by 
the symbols , and hit RETURN (or EXECUTE). The symbols are a 
comma followed by an upper case L.U is called ‘‘quad,’’ and you will be 
meeting this useful operator again. The computer will respond on the next 
line by printing U: after which you carry on entering data until finished. 
You can use this device to enter as much data as you wish. 
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Summary 
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The Computer Reads from Right to Left. 
Expressions in Innermost Pairs of Parentheses Are Executed First. 


Arrays of numbers and scalars can be given names by assignment, e.g., 
X+1 2 3 assigns the name X to the array 1 2 3. There are name limitations 
on the symbols that can be used in making up a name. Names should begin 
with a letter—after that any other alphabetic character or number is okay. 
The name can be as long as you want (only the first 77 characters will be 
recognized); don’t put blanks inside the name. 


Dyadic functions have two arguments. 


Monadic functions have one argument; the order is ‘‘f array”’ or ‘‘f 
number.’’ 


Many symbols, such as +, -, x, +, do double duty and represent both 
dyadic functions as well as monadic ones. 
System Commands are instructions about the operation of the computer. 


They are indicated by a right parenthesis: ). 
)VARS is a system command which instructs the computer to list all the 


variable names assigned by the user. 
Reduction, /, a monadic function, is used with the arithmetic functions 


on arrays: 
Pi Ma Ke oh eg he produces x,fx2fxef.. . fxn 


Shape, p, displays the number of elements in an array. 
Arithmetic Mean (of an array X): Mathematically, M = (24.,x,)/N, and 
in APL 


N+ pX 
Me (+/X)=N 
Scan, \, a monadic function, is used with the arithmetic functions on 
arrays: 
f\%1, Xo,» » » 5X produces Y,, ¥,,... , Y, where Y, = x,, Yo = x, f%e, 
Y; = Xi Sxofx3, se ey Y,, = X1fxX_fx3, cee y ae 


The exponential function, «, has a monadic and a dyadic use; see 


Table 1. 
The logarithmic function, e, has a monadic and a dyadic use; see 


Table 1. 
Geometric mean (of an array X): Mathematically, G = (II},(x))'. In 


APL, 
NW op X 
G< ( x/X)*=M 


Harmonic Mean (of an array X): Mathematically, H = N(2%,x7"')"', 
and in APL, 


H<Ne+/3X 
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Sample Variance (of an array X): Mathematically, V = 2%, (x; — M)?/ 
(N — 1), where M is the arithmetic mean. In APL, 


Ve ( +/ €C X-M)*2))4N-1 


The Sample Standard Deviation (of an array X) is the square root of the 
sample variance. 

Inner product (between two arrays X and Y of equal length): Mathemat- 
ically, JP = xyy, + Xey2 + °° * + + x,y, In APL, 


IP*«X+.xY 


Mean (arithmetic) for Sample Proportions: Mathematically, MF = =, 
Sr; PR, ; Where K is. the number of cells and fr; is the relative frequency of 
the ith value of PR;, the ith sample proportion. In APL, 


MF<FR+.x*PR 


Variance for Sample Proportions: Mathematically, VF = =, 
Jr{PR; — MF)?. In APL, 


VF«FR+.x (PR-M)*2 


How to continue entering data over more than one or two lines: use ,[ at 
the end of a line of input on the terminal. 


Chapter 3 


APL Practice 


1. Let’s explore the uses of the functions defined in this chapter. 
Let P+3 and Y+1,2,3,4. 


& & & 
(a) p P (e) p Y 
(b) p 3 () pp ¥ 


(c) pp 3 (g) +/Y and compare it carefully with +\¥ 
(d) pp P (h) (+/Y@®Y) which is the same as p Y¥. Why? 


2. Assign the values 1,2,3,4 to xX, the values wg ey Oy to W, and 
define Z by Z+«x. 


(a) 1*X (m) Xx0 

(b) X«1 (n) o@W 

(c) 2*X (0) “18x 

(d) Xx2 (p) xe1 

(e) OxX (q) @Zwhich is x 

(f) x@e0 (r) +/X+ 9 X 

(g) 7*W (s) ( +/X)t p X 

(h) xX* 1 (t) What is the difference between (r) and (s)? 
(i) 10x (u) +/XxW*x2 which is the same as X+. xW*2. 
(j) xe1 (v) -\Xand compare with -/X. 

(k) 2@X (w) -\Wand compare with -/VW. 


(1) Xe2 
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3. Evaluate the polynomials f(x) = x? — 5x + 6, f(x) = (x® — 1)(x? — 2), 
and fix) = xt — 3x? + 2 for X<"54 34 14 041,2,3. 
4. Practice the right to left rule by solving: 
(a) 10+ 3+-5 (f) -/1,2,373 
(by 793.3220 (@) ayedee 2 2 
CO) 28s ve dO (h) +\2 34 
{(d) 3-X+6-10 (i) +\10*123 
(e) +/t1 oa 
You should be able to find the answers without using the terminal and 
then use the terminal to check. 
5. Let the arrays X and Y be defined by 
ALP ait ‘ +410, Yo Q4+X*.3, 
Practice the algebra of summations by trying: 
APL Form Math Form Explanation 
(a) +/7X DMX; the sum of the reciprocals of the elements 
of X 
(b) +/X-1 DM, (X%, - I the sum of the differences X; — 1 
(c) +/X*2 Dey, Xe the sum of the squares of the elements of X 
(d) (+/X)*2 (= 2, XP? the squared sum of the X; 
(e) +/®X De, nx; the sum of the natural log of the X; (natural 
log is log to base e) 
(f) +/*X Dh, e* the sum of e raised to the powers X; 
(g) -/X DM, (- DEX; 
(h) -/-X DM, (1X, 
(i) +/XxY Thy XY; These sums occur very frequently in 
G) +/YxXx2 Dh XPY; regression analysis; see Chapter 8. 
6. Verify that the first element of X is equal to the first element of +\X, 
while the last element of +\X is equal to +/X. 
7. Here are some more summation formulae. Try expressing each in 


terms of APL functions. 

(a) DL, X; - 1 

(b) zh, e7 

(c) DL, (X; + 3)? 

(d) 2%, (-Ii1/X)) 

(e) (24, X))/N 

(f) 1/22, (1/X)) 

(g) 2, (xX? — 5)? 

(h) Show that 2%, AX; = ki, X; 

(i) Show that S4., (X; + 2)? = =%, X? + 2NZM, X; + N2? 

G) Show that 2/2, (X¥; — (2%, Xi)/N) 
= DL, X? — 1/N(@A, X;)? 
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where N is the number of elements of X. As there are several ways to 
write each of the above, the suggested procedures in the solutions may 
not always be the same as yours. You can, however, check that your 
procedure is correct by computing the numerical solutions. Use the list 
X defined in exercise number 5. 


Interpret the following APL statements in mathematical form: 
(a) +/2X (d) ++/4Xx¥ 

(b) -/:X (e) t+/ ( X-+/Xt p X)x2 

(c) -/+-X (f) ( Xx¥)- ( +/X)« ( +/Y¥) 

Which of the following are invalid names? 

(a) ABE LINCOLN 

(b) Bx 

(c) LOUIS THE 14TH 

(d) XXX+YYY 

(e) X1X2X3 


(f) 2W+3Y 
(g) IWILLBEHERETOMORROWATTHREE 


Statistical Applications 


1. 


When the elements of an array are ratios, the geometric mean may be a 
more useful measure of central location than the arithmetic mean would 
be. If an array has elements that are rates of change, then the harmonic 
mean is usually preferable. 

Consider the following data: 


U.S. Total Residential Debt Outstanding 


Year in Billions of Dollars 
1966 274.2 

1967 292.0 

1968 312.8 

1969 335.9 

1970 357.8 

1971 398.0 

1972 454.5 

1973 509.8 

1974 549.8 

1975 593.0 

1976 655.0 

1977 711.2 (estimated) 


Source: U.S. League of Savings Association Publications #24, 1977, 
p. 28. 


(a) Find the arithmetic mean of U.S. residential debt outstanding 
during the 12 years. 
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(b) Find the ratio of each year’s debt to the previous year’s. 
(c) What is the geometric mean of these ratios? 


(d) Find the percentage increase of the debt series from each year to 
the next. (This is equal to the ratios in (b) minus 1.) 


(e) What is the harmonic mean of the percentage increases? 
(f) What is the mean rate of growth in U.S. residential debt? 
(g) Explain intuitively your answers to (a), (c), and (f). 
2. In the following table CE represents the number of cracked eggs in a 


carton (each carton contains 12 eggs). CN is the number of cartons out 
of a sample of 60 cartons, randomly selected from a shipment of 2,000, 


that contain cracked eggs. 


— | 0 1 2 3 4 5 6 7 8 9 10 it 12 
0 5 7 Il 16 8 4 5 2 l 0 0 1 


(a) What is the average number of cracked eggs in each carton? 

(b) What percentage of cartons have fewer than 2 cracked eggs? 

(c) On the basis of your answers to (a) and (b), shouid the shipment be 
accepted? (A shipment is acceptable when 8% or less of the eggs 
are cracked.) 


3. Let Wi = Ty . . ,5, take the values 3,4,7,5, 11, and Y; = 3 a 2W;,. 
Find W, Y, Sty = (224(Wi — W)?)/4,  S} = (2? ¥i — ¥)?)/4, 
Sw = VSi, Sy = VS%. Verify that Y=3+2W, S? = 45%,, and 
Sy = 2Sy. 


4. Letr represent a list of estimates of the interest rate for next year: 5%, 
6%, 7%, 8%, 9%, 10%. Suppose we believe that the respective prob- 
abilities that these values will occur are 0.1, 0.2, 0.3, 0.2, 0.1, 0.1. Find 
the expected value of the interest rate. What is the probability that the 
interest rate will neither fall below 6% nor exceed 9%? (Note: the 
expected value of a variable X which can take on only discrete values 
is defined by 2}_, X,;P;, where P; is the probability that X; will occur.) 


5. Consider a gamble wherein a fair coin is repeatedly tossed until a head 
turns up. Ifa head is obtained on the first toss the payoff is $2; it is $4 if 
a head is obtained on the second toss, $8 on the third toss, and so on. 
Use the computer to find the expected return if the coin is tossed at 
most a total of one hundred times. (A new wager is made after each 
time that a head appears.) 
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6. The following measurements (in dollars rounded to the nearest integer) 
represent the increase (if positive) or decrease (if negative) in the daily 
closing price of General Motors and General Electric common stocks 
for 10 consecutive days. 


GM 2 | 3 0 0 1 “1 2 4 0 


GE 3 4 “1 0 “1 “2 0 4 3 “2 


(a) Which stock would you prefer with respect to average daily re- 
turn? 

(b) If the larger the variance of a security the larger the risk, which is 
the riskier security? 


7. Imagine yourself sitting at the roulette table in Las Vegas, having lost 
the family fortune, and being left with only $200 at your disposal. The 
roulette wheel will be spun about seventy more times before the table 
is closed. There are 36 numbers and a double zero on the wheel. You 
decide to bet $3 on the number 11 repeatedly. The rule of the game 
is as follows: If 11 comes up you get $105 ($3 x 35), the payoff 
is $35 for a $1 bet; if any other number comes up you lose the $3. 
What is your expected cash position at the end of the night? 


How to Write 
Your Own Function 


In this chapter, after defining a few more APL expressions, we introduce 
the important concept of a function which you can write yourself. At the 
end of this chapter your ability to apply APL will have taken a big leap 
forward. Let’s press on. 


4.1 The Sample Median 


Sample Median 


Residue, | 
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One measure of central tendency that is in widespread use is the median. 
The median is that value for which half of the sample values are less than 
or equal to it and half are greater than or equal to it. The median for n 
observations is defined mathematically by 


_ | ae + Xne+)/2 ifn is even 
Ma [ome 1)/2 +1 ifn is odd (4.1) 


where x, =x. =... x, are the order statistics obtained from n ob- 
servations. That is, the x observations are reordered so that the smallest 
observation is first (x,) and the largest is last (x,). 

Calculating M in APL might seem to be a formidable task, but in fact it is 
quite simple. In addition, calculating the median will introduce us to some 
useful programming tools. 

We have two problems to solve—discovering whether n is even or not, 
and reordering the array to get the order statistics. That is, we want an 
array with the smallest observed number first, the largest last, and with 
each number less than or equal to the number on its right. Let us begin with 
some useful new APL tools. 


The Residue Function 


The first of these tools is the residue function, |, which is upper shift 7. 
The residue function applied to two numbers A and B, denoted by 41[B, 


Absolute Value, | 
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yields the A residue of 2, and is the ‘‘remainder’’ left over after dividing A 
into 8. For example, the 2 residue of 3 is 1, the 3 residue of 8 is 2, the 4 
residue of 8 is 0, and so on. Of particular interest is the 1 residue of a 
positive decimal number; thus 


1[2.5,0.6,4.0 
0.5 0.6 0 


In other words, the | residue of a positive decimal number is simply the 
decimal fraction of that number. Let’s try some more examples: 


30123456789 10 
01201201201 

10| 11,102,1032,11021 
4-99 4 

1] “6.0, 6.4, 0.3 
0 0.6 0.7 

213,4.0, 3.0,6.2, 6.2, 4.0 
1010.2 1.80 

3[4.0, H7O0,8 0, S00,4.25 42, 48 


D2 gods 2, By de? 


The results for the first two arrays are clear enough, but what about the 
others? Close inspection reveals a difficulty in interpretation of the results 
only when we try to get the residue of a negative number. What is occurring 
with negative numbers will become clear as soon as we understand the 
residue operation with positive numbers. 

If we look back at the first example we notice a recurring sequence 0 1 2. 
Suppose that we extend the array to include negative numbers, say 


3/ “10 “9 “8 7 “6 5 4° 3°72 °101423456789 10 


20420142014201212014201201 


Note that the pattern is exactly the same. 

However, 3|4 is 1 and 3| 4 is 2, so the result is not simply the residue of 
the absolute value of the righthand list. In fact the monadic use of the | 
symbol is the absolute value function. For example 


{74 


[SOc SOAs? Bs 4 


2.1,1,0,1-2,3.1 
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Absolute Value and 
Residue to Compute 
Fractional Parts of 
Positive & Negative 
Numbers 


Logic of Residue 
Function 


Statement 3|~10 


Result 
Index 


Residue = 2 
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So we could find the fractional parts of elements of a vector that had both 
positive and negative elements by 

1{|~2.1 “101.2 3.4 
O. 1,9 2° go 2 Po 1 
Remember, we proceed from right to left, first finding the absolute values 
of the array by using the monadic | function and then using the dyadic | 
function to find the fractional parts. 

While we have shown that the residue is consistent in its operation on 
positive and negative numbers, we need to explain the logic behind this 
consistency. Here is one way to think about it. The residue for a positive 
righthand argument is obtained by successively subtracting the lefthand 
value until the result is less than the lefthand value. If we have 3|7 we 
subtract 3 from 7, yielding4, then subtract 3 from4, leaving 1, which is less 
than 3 so we stop. The last value is the residue. When the righthand value 
is negative we add the lefthand value to the right until the result is positive. 
Soifwehave 3]! 7, weadd 3to 7, yielding “4, then add 3 to “4, leaving 
“1, then add 3 to “1 to get 2. This process defines the residue function for 
positive lefthand and negative righthand values. 

Finally, here is a diagram of the results of the first example: 


O 1 2 0414 2 014 2 04 2 014 2 04 2 0 =¢42 
2 3 4 5 6 7 8 910 11 12 13 14 15 16 17 18 19 20 21 


Diagram 1 Illustrating the Value of the Residue When Dividing Numbers by Three* 


“12-11-10 9 -8 -7 6 “S$ 4 3 72 71 9 10 IT 12 


Arrays: Parity 


* Right argument of residue on x axis, residue on y axis, numbered points are the index values (see 
above). Locate 4 on the bottom of the diagram, move vertically until you cross the diagonal line, read 
the residual value 1! on the left. Check to see that it is element 15. 


Parity of the Number of Elements in an Array 


After this lengthy digression we can take the first step in finding the 
median. Does the array have an even or odd number of elements? If you 


Ordering Arrays 


Indexing Arrays 
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have re-logged-on to do this chapter, you will have lost the X and Y arrays 
defined in Chapter 3. If so, please enter these values first: 


Bl eee 3y 4e Ig Og 7 


Ye bye e4 26 


Solution: 
N+ op X 
2\N 


If the result is 0 we have an even number of elements (W is divisible by 2 
with no remainder), and if the result is 1 we have an odd number. In our 
example VV is 7, SO we have an odd number and the result of 2|¥V is 1. You 
may be wondering why we bother with all of this, since printing N tells us 
immediately if N is even or odd. The reason is that we are laying the 
foundation for the computer to make the decision itself, without our inter- 
vention. 


Ordering the Elements of an Array 


The next problem is to reorder the array X so that the smallest element is 
first and the largest is last. To be safe and not lose the original order, let us 
store the reordered array X in a new array called #. The process of reorder- 
ing an array introduces two new concepts, the grade up (or down) function 
and the simple mysteries of indexing. Let us first consider the indexing of 
arrays and the indexing function. 

Since a one-dimensional array is merely a list of elements, it seems 
natural to give the position of each element in the list an index value and to 
let that value be the position of the element in the list; in short, the first 
element going from left to right is element number 1, the second is number 
2, and so on. APL lets us refer to the elements in an array in a simple 
manner. Type out the following: 


W910 .3,8,12,0,1,5 
W 


9 T00 3 28 A220 T eS 


wl1] 
) 

WLS] 
12 

Wh 23] 
9 10 3 


W{3,1,21] 


38 


INDEX ERROR 


Grade Up & 
Grade Down 
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3; 9-10 
W 
9. TO" 38) 12°09 <1) 5 


The examples illustrate what is happening. The indexing of an array allows 


‘you to select from an array the individual elements of an array in any 


order. In addition, specific elements in an array can be replaced very 
easily; for example, 


WI 4 


Wh1, 416,15 
W 
G40 S15 da as 


The contents of [ ] can be any valid APL expression, provided only that it 
results in an array of integers, none of which is larger than pW, the number 
of elements in the array W. Try 


WEO] 
INDEX ERROR 
W{0J 
A 
WL2 2 2) 
10 10 10 
WLS J 
INDEX ERROR 
WI9 J 
A 
WE 2] 
INDEX ERROR 
we 2) 
A 


Now consider the functions grade up, 4, and grade down, Y. Each is 
formed by typing one character on top of another (the | we used for 
residue (upper shift ”), backspace, and then upper shift 4 for A (delta) or G 
for V (del)). Let’s try it on the array / that we defined above: 


AW 
67381254 


Sorting an Array 


Sample Median 
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and 
VW 
452148376 


To see what is happening here, write down the elements of ¥ with their 
indices underneath them: 


Ww: 6 10 3 15 1 


2 0 
index: 1 2 3 4 5 6 


1 5 
7 8 
We see that 4 (grade up) gives us a new array in which the first element is 
the index value of the smallest element of W, the second element is the 
index value of the second smallest element of W, and so on. Grade down 
(Y) also indicates the order of the array, except that the first index value in 
the new array is that of the largest element in W, the next is the index value 
of the second largest element of W, etc. The last index value in the new 
array is the position of the smallest element of W. 

You might be able to guess what we should do now in order to get an 
array with the elements of Wor X or Y listed in either ascending or descend- 
ing order. Try 


WUP<W AW | 
WUP 

Ot Boba 6: 10 Ags 15 
WDOWN+W YW) 
WDOWN 

ae aap iG ass ec TO 


This is all you need to sort any list of numbers. The grade function orders 
the index values inside the square brackets, and then the elements rep- 
resented by those positions are selected. 


Calculation of the Sample Median 


Now we can calculate the median. If our expression for discovering the 
parity of N tells us that N is odd, then by Eq. (4.1) the median value of the 
elements of X is element number ((N — 1)/2) + 1 of the ordered array. (It 
is irrelevant here whether the elements of X are arranged in ascending or 
descending order.) Let’s try it, but first let’s recall what X and N are. 


Catenate, 


3 
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XUP*<XL AX] < rearranges X in ascending order 
and stores result in XUP 


M<XUP[1+ (N-1)#2] < picks out the middle element 


of x 
M UP 


3 


If V is even, the median is: 
W 


6 10 315 12015 


WUP<W[ AW] 

K+ (p W)#2 

MeWUPLK ,K+1] <— picks out the middle pair of 
elements 

Me ( +/M)+2 

M <— averages them 


You can easily check by visual inspection that both calculations are 
correct. Each of these two sets of APL statements can be called a ‘‘rou- 
tine’’; a routine is a group of APL statements that perform some operation. 

These two APL routines worked, but the third line in the second routine 
contains something strange—the contents of [ ] must be an array of inte- 
gers, but what is the comma doing there? Why does WUP[K,K + 1] work? 
First, ask yourself what you would get if you typed inK,K + 1, where K 
has the value 4. If you incorrectly read from right to left, you might believe 
that the answer would be 5 5. You would be incorrect in believing that K 
K + 1 is computed as ‘‘to the array K,K add 1.’’ APL expressions are 
computed from right to left. But we cannot separate K and K + 1 by 
parentheses alone, e.g., K (K + 1), since APL expects a function to ap- 
pear between K and (K + 1). The function represented by “‘,’’ is discussed 
next. 


The Catenate Function 


The solution to our little problem introduces a very useful concept and a 
useful function, the catenate function ‘‘ ?’, which is keyed in by typing a 
comma. The catenate function enables us to extend an array or to make 
arrays out of scalars. For example, let A and B denote arrays of length p 


and q, respectively, and let K and L be scalars. Then 
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A,B produces an array of length (p + g) with the A elements 
coming first, 
B,A_ produces an array of the same length, but the B elements 
come first, 
A,K produces an array of length (p + 1), 
K,L produces an array of length 2, 
,L_ produces an array of length 1. 


So the solution to our problem is to create the array K,K + 1, a two- 
element array of which the first element is K and the second is K + 1. 

We have now solved the problem of computing the median but, espe- 
cially if N is even, it is a nuisance to type out all these statements each time 
we want the median. What would be convenient would be a function 
called, say, MD, such that typing MD,X (where X is an array) produces the 
median of X. In short, we need a monadic function just like the ones 
provided by APL, like 4, o, |, and so on. To meet this need, APL provides 
a method whereby the user can supplement the set of primitive APL func- 
tions (those available from the keyboard, such as *, +, etc.) with user- 
defined functions. Once the user function is defined, it can be used over 
and over again just by calling it—for example, by typing “D,X, where X is 
an array, the median of which is defined by the function called mp. 


4.2 Function Definition 


Entering Function 
Definition Mode 
V 


Niladic Function 


Our first step is to tell the computer that we want to define a function to 
be used later. This is done by typing V (called ‘‘del’’), which is the upper 
shift G key. We will also need to inform the computer when we are finished 
defining the new function. That is done by typing in V again. Thus the 
computer expects that between the two V symbols it will be receiving 
instructions that will define a new function. 

Certain rules must be followed when defining functions, and we will 
consider these rules now. If we type V, which tells the computer *‘ Function 
definition coming up,”’ our next action must be to give the function a name. 
Once we have named our function, both we and the computer can refer to 
it even while we are in the process of defining it. But in APL we not only 
name a function, we can also give it some arguments—that is, something 
for the function to operate on. 


Niladic Functions 


In §3.2 we defined monadic and dyadic functions—functions with one 
and two arguments, respectively. We can also have niladic functions, 
which do not have any arguments. A niladic function is called simply by 
typing V and the name of the function and hitting the EXECUTE or RE- 
TURN key. It then carries out the procedure specified in the function— 
you do not give it any arguments to work on. Let’s consider an example. 
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)CLEAR 


Roll ? 


Correcting 
Typing Mistakes 
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Suppose we want to run some experiments on the computer to check out 
the statistical properties of the probability estimator PR discussed at the 
end of the last chapter. One of the first things we have to be able to do is to 
generate data similar to that which we would get from rolling a pair of dice. 
If N, the sample size, is 16 or 20, and NN, the number of repetitions of the 
experiment, is 100 or more, and if we want to see how the mean and 
variance change as we alter N, then we are facing a very large amount of 
dice throwing. Such an activity may not be your favorite way of spending a 
Sunday afternoon in the sun, but if it is pouring rain, go ahead and roll the 
dice. Fortunately instead of rolling dice we can use the computer to simu- 
late the process of actually tossing them. So, let’s generate some data. 

We begin by defining a function that will give us one roll of a pair of dice. 
Before beginning to define our own functions, let us clear the decks, so to 
speak, of any other nonprimitive functions that might be already in our 
‘*workspace’’ and might give us some difficulties and strange responses. 
We do this by typing )CLEAR, and the computer responds by telling us that 
our workspace (you can think of it as the part of the computer that we are 
using) is clear. WS stands for workspace. You will recall that the symbol ) 
indicates that a system command is coming up. Type 


)CLEAR 
CLEAR WS 
VR«DICE 


C1] 


The use of ¥ in the first line is clear enough; it tells the computer that we 
are defining a function. In more technical language, it puts us into ‘‘func- 
tion definition mode’’ instead of ‘‘execution mode.’’ The word DICE is the 
name of our function, but why 2? We will explain that one in a minute. 

The computer responded with a [1]. This is its way of telling you that it 
understands that you are defining a function, and [1] is the number of the 
first statement for you to write. The computer now expects you to write an 
APL statement, so let’s oblige. We will be using a special APL symbol, ?, 
called roll, which is upper case Q. Be careful not to get confused. Some 
terminals have two ? symbols, one for APL and one for non-APL use. If 
you are unsure about which is correct, just experiment—you can’t hurt a 
thing. Starting directly to the right of [1], type in 


[1] R<+/?6 26 


and then hit (but only after checking line [1] very carefully to see if it is 
correct) EXECUTE. The computer will respond with [2] and you typev 


[2] Vv 


Then hit EXECUTE (or RETURN on some terminals) again. 
If you made a mistake on line [1], and you noticed it after pressing 
EXECUTE, then you can correct it very easily by typing in on line [2] 


[2]J[1] R«+/?76 6 


Generating Random 
Numbers 


Function 
Headers 


4.2 The Function Definition 43 


In short, you repeat the line number of the incorrect line and type in what it 
should be. 
We have defined our function, but what is it? Type 


DICE 
6 

DICE 
7 

DICE 
3 


Generating Random Numbers 


By now you have guessed that DICE is a function designed to give you 
the result of rolling a pair of dice. The basic element in the function is the 
primitive function roll, for which the symbol is ?. Typing 


76 
5 


produces arandom number between 1 and 6, typing ?10, a random number 
between 1 and 10, and in general ?N produces a random number between 1 
and N. The probability of any one specific number showing up is 1/N. This 
is the monadic use of the aptly named roll function. Now we see why ? 6,6 
produces two numbers from 1 to 6, each of which is equally likely to occur; 
this is the computer’s simulation of rolling a pair of dice. 

Our statement number [1] is now clear: ? 6 6 gives us an array of two 
random numbers (statistically independent in fact), and +/? 6 6 gives us 
the sum of the two random numbers. 


Function Headers 


We are now ready to answer the question about the function header—the 
line that defines the function. (In this case the function header is VR<DICE.) 
In line [1], instead of writing R++/? 6 6, we could have written: 
+/? 6 6. What’s the difference? In either case, calling DICE produces the 
desired result in that the computer prints out the sum of a random tossing 
of two ‘‘computer dice.”’ 

The distinction between the two statements becomes apparent when we 
consider whether we want to use the result in some other APL statement. 
For example, we might wish to calculate DICE+2. Alternatively, we might 
want a function such that when the user types #7, the computer responds 
WHAT IS YOUR NAME. Now we do not want to use such a function in any- 
thing else; we just want the computer to type out WHAT IS YOUR NAME. 


Interactive 
Programming 


Quote ' 


Monadic Function 
Headers 
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The DICE function is said to ‘return an explicit result,’’ something we 
can use elsewhere, and the #7 function does not produce an explicit result. 
If you want an explicit result, you need the header in this form: Vv (Some 
Variable Name) <« (Name of Function). Somewhere in your function you 
will produce the explicit result that you want; that result must be stored 
under the same variable name as that used in the header. This is what 
we did in DICE, If instead of using R in the header we had written 
VD«DICE, then line [1] would have to read 


[1] De+/? 6 6 


The Quote Function 


To provide a contrasting example of ano explicit result function, consider 
the function #Z mentioned above. Let’s define HZ: 


VHT 
[1] "WHAT IS YOUR NAME?'* <— quote is upper shift K 


Lado¥ 
Now try typing 
AT <— you type 
WHAT IS YOUR NAME? <— computer responds 


This example introduces another useful APL function, the quote ('), 
upper shift X. The characters between the quote signs are treated as just 
that: characters. When the computer comes across a line like [1] when in 
execution mode, it simply prints what it sees. 


Monadic Function Headers 


Let’s return to functions with arguments. First, the header of a monadic 
(single argument) function is of the form 


opens ———-———_> MDX <————_—_‘‘dummy”’ 


function we variable name 
returns an function 


explicit result name 


Why is Xin the above header to the function MD called a ‘‘dummy’”’ variable 
name? The reason is that in the header, X represents the argument that the 
function is supposed to operate on, and X appears later in the body of the 
function. But when you use the function MD on a variable V, or a variable P, 
or A, you would simply type in MD V, or MD P, or MD A. 

Remember the median? We showed how to calculate it on page 40. Now 
we know how to make it a function. First, we need to decide on a name; 


Unconditional 
Branch > 


Line Labels 


Conditional 
Branches 
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MD seems to be as good as any. Second, we need to decide whether we 
want an explicit result or not. Presumably, since the median 1s going to be 
used in some other calculations by someone at some time, it would be best 
to have it return an explicit result. Third, do we want the function to 
have arguments? Well, the answer to that is that if we have a vector, say X, 
in the computer we might want its median, and an easy way to say that is 
'MD X*, so we want a monadic function. Recall that our first task is to enable 
the computer to decide whether X has an odd or an even number of ele- 
ments in it (see page 40). But, somewhat more importantly, we want the 
computer to decide which way to calculate the median on the basis of 
whether ¥V is even or odd. In short, we want the computer to do one 
computation if V is even and to branch to another computation if WV is odd. 
As you might guess, this is a not so subtle hint that we are about to define a 
new APL operation; this one is called ‘*branching.”’ 


Unconditional Branch 


Suppose that in some APL routine you want the computer to go directly 
to line 6 if it gets to line 3. This is easily done by entering ~>6 on line 3. 


[3]+6 


The symbol > is the upper shift of the « key. This is called ‘‘unconditional 
branch.”’ In this case, the computer would know that if it ever gets to line 3 
it must go directly to line 6 and execute it next. 

Now for reasons that will become apparent in Chapter 7, it is a good idea 
to develop the habit of giving labels to lines to which the branch directs the 
computer to go. So if we label line 6 ‘‘S7X’’, for example, then the state- 
ment on line 3 could read [3]>STX. In this case line 6 would look like this: 


[6] SIX: {some APL instruction to be performed} 


The computer reads from right to left, so when it encounters the symbol : 
in a line of a routine, it knows that everything to the left of : is the label 
for this line of instruction in the routine. We have simply named the line 
SIX. It now has a position number [6], and a label or name, SIX. 

Let us now discover how to solve our problem. Recall the two ways in 
which we calculated the median (page 40). What we want to do is to go to 
one set of instructions or the other, depending on whether JV is odd or even, 
and then to exit from the function. Let’s see if we can program the follow- 
ing steps: 


(a) determine whether JN is even or odd; 

(b) if Vis odd, continue with the ‘‘odd’’ calculation and then exit from the 
function; 

(c) if W is even, branch to the ‘‘even’’ calculation and then leave the 
function. 


Index Generator, 1 


RANK ERROR 
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Consider (but do not type anything yet!): 
VY MeMD X 

[1] Neo Xx 

[2] +ODDx12|¥ 


The last part of this line is clear enough: 2|¥ produces 0 if Vis even and 1 
if V is odd, since 2|W is the ‘‘two’’ residue of WV, but what can we make of 
the first part? Let’s look at the next section. 


The Index Generator 


The symbol , is the ‘‘index generator’’ (upper shift 7). For any integer N, 
iN produces the ‘‘index array’? 1,2,... ,N. 14 produces an array with 
only one element in it, namely 1. But what about10 ? This produces an 
‘‘array’’ with nothing in it! The index generator, the monadic use of 1, 
unlike most of the other primitive functions in APL, cannot operate on an 
array; e.g.,14, where 4 is an array, produces the result RAVK ERROR, 


Conditional Branches 


If Wis odd, .{W produces 1, so line [1] produces an array of length 1 
with a 1 in it, and multiplying this by ODD produces ODD. The first part of 
the statement says: Branch to the line labelled ODD. But what if W is even? 
Then 12|¥ gives an array with nothing in it, and ODD multiplied by that 
produces an array with nothing in it still. APL interprets +(an array with 
nothing in it) as ‘‘ignore this silly statement and continue to the next line.”’ 
So we have in line 2 a conditional branch statement; go to the line labelled 
ODD if Nis odd, otherwise continue. So let us begin and put down the 
complete function. If you make a mistake typing any line, remember how 
to correct it: backspace to the beginning of the error and hit the linefeed or 
ATTN key, depending upon the type of terminal and computer system that 
you are using. On most CRT screens you just backspace and reenter the 
line. 


V MeMD X 
[1] Wo xX 
[2] XUP*+XL AX] 
[3] +ODDx12|N <if N is even, calculates M 
. for even case, otherwise goes 
EVES CO eee to line ODD 


C5] M<(+/XUP[K,K-1])+2 


(6] +0 <tells computer to leave function 


Exit from Function 


Correcting 
a Defined Function 
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[7] ODD:MeXUP[1+(N-1)22] < if N is odd, calculates y 
for odd case and then 
[8] : 
leaves function 


Hold it! 

Before typing the final V and completing the function definition, check 
your entries very carefully. If a line has an error in it, then enter its line 
number (in brackets), type the correct version of the line, and hit RE- 
TURN. The computer will respond with the line numbers following the one 
you entered. If there are no more corrections, type in V and RETURN; 
otherwise repeat the process. 

The first and third lines we have figured out. The three lines [2], [4], 
and [5] are copied from page 40, where we wrote down the routine for 
calculating the median M when N is even. But line 6 is a puzzle. Notice 
that the header of the function has no line number, and that line numbers 
start with 1. The function >0, or-(any line number not in function) tells the 
computer to ‘‘exit’’ from the function and go back to the point from which 
our function, MD, was called. In short, when the computer hits line 6 it 
knows it has finished calculating the median. It is best to stick to ‘‘>0’’ to 
exit from the function, since you may later want to add new lines to the 
function. The last two lines are modified from page 40, where we showed 
how to calculate the median where WV is odd. 

Let us try out our new median function. We put into the computer the x 
and W arrays we used earlier: 


Xe12°3 7°45 °6°7 
WH6 1061512015 


and now we can check the function out. 


MD x 


MD W 


Correcting a Defined Function 


What if you enter MD X and you donot get 3? You’ve made a mistake! 
Let’s see how to correct it. First, relist the variables X and W and make sure 
that they are correct. If they are, continue; if not, redefine X or W and retry 
the function. 

But suppose you are sure the mistake is in the function MD? In order to 
check it, you will need to display it. To do this, we type in: 
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Display, U 


)ERASE Command 


Removing a 
Suspended 
Function 


)SI Command 
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V MD[OI] V < this line ‘‘opens’’ the function, 
displays it (that’s what [0] does), 
and closes it. LD) is typed in by 

[1] W+p X upper. case L. 


V M«MD X 


[2] XUP+XTAX] 


[3] >ODDx12|N — 
< this is the computer’s response 


eee ee to your command to display the 
C5] Me (+/XUPLK,K+1])+2 function 
[6] +0 


[7] ODD:MeXUPC1+(N-1)+2] 
V 


Suppose you discover that you made some typing errors in the function. 
In Chapter 5 we will show you in detail how to correct errors, but for now 
we will give you one drastic, but surefire, way to do it. If we remove the 
function altogether, we can completely redo it—this time very carefully. 
To remove the function, type 


)ERASE MD 


SI DAMAGE < this statement is the computer’s 
response. It warns you that you 
have erased a suspended function. 
SI stands for State Indicator. 


Suspended Functions 


You might also get a statement indicating there is a syntax, value, or 
some other error in a line of your function when you try to execute the 
function with an error in it. For example, 


MD X 
VALUE ERROR < this is the computer’s response 
pane telling you the first line and 
4d x first position of your first 
A mistake 


The result of this error is to prevent further execution of the function. It is 
said to be suspended. At this time, it is best to remove the suspension. This 
is easy—type 


6699 


> < this is upper case ‘‘+ 
In order to check whether you have cleared the suspension, type 


SI 


Summary 
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If you get no apparent response (actually a blank line) all is well. But if you 
get something like 


MD[3]* 


type in the > instruction again. Keep doing this until you get a blank 
response to the system command )ST. 

Now erase the incorrect function and start again. Remember, you can- 
not correct the function simply by redefining it unless you first erase the 
old function. (This is not strictly correct—you could do it if you gave the 
correct function a new name (i.e., one that differs from the old name).) 

One last word: if you think that the MD function is a lot of fuss about a 
simple operation, you’re right. Later on we will show you how to calculate 
the median in one short line. But in writing out the routine MD you learned a 
lot about APL that will be very useful in the following chapters, and that, 
of course, is what this book is all about. 


Sample Median, M (of an array X): that value such that half the 
observations in X are less than or equal to M and half are greater or equal 
to it. 


Residue function, |: in its dyadic form it is written A|B. For posi- 
tive values of A and B, the A residue of B is the remainder of dividing B 
by A. 

The monadic use of | yields the absolute value. 

Indexing, [7] (of an array X): elements of an array X can be 
selected by specifying their position in the array; the positions are labelled 
from 1 to pX starting from the left. X[T] gives the value of the number in 
the array X in the /th position from the left. 


Grade up, A (upper shift ”, backspace, upper shift #) (of an array 
X): gives an array, the first element of which is the index position of the 
smallest number in X, the second element is the index of the next largest, 
and so on. 


Grade down, ¥Y (upper shift M, backspace, upper shift G) (of an 
array X): gives an array, the first element of which is the index position of 
the largest number in X, the second element is the index position of the 
next smallest, and so on. 


ce 99 


Catenate, ‘‘,’’ (the comma): forms one array out of its arguments; 
A,B is an array with the elements of A listed first where A or B can be any 
one-dimensional array or scalar. 

Roll, ? (upper case @): ?¥, where ¥ is an integer that generates a 
random integer from 1 to WV with probability 1/N. 

Index Generator, 1 (upper shift 7): monadic function, 17, ¥ an 
integer, produces the ‘‘index”’ array 1, 2,3,...,N. 
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Quote, ' (upper shift X): all keyed entries between a pair of 
quotes are treated as characters and not executed as APL functions. 

Del, V (upper shift G): the instruction that is used when entering 
or exiting from function definition mode. 

Niladic function: a function which has no arguments; it carries 
out a specified procedure when called. 

Function Headers: first line of a function that gives the function 
name; specifies whether function produces an explicit result or not, and 
contains either zero, one, or two arguments. Thus: 


Vv AT niladic (no argument) function, no 
explicit result produced 

VAGOB dyadic function, no explicit result 
produced 

V R«FNC niladic function, explicit result 
produced 

V R<FNC X monadic function, explicit result 
produced 


(All six combinations are possible) 


In function definition mode, a line in a function can be corrected by 
typing in the line number and the correct version of the whole line. For 
example, in correcting line [1] at line [3], you would type:[3] [1](cor- 
rect APL statement). 


Unconditional Branch, >: used within a function to alter execu- 
tion path to a different line. Thus: 


[3] +6 
or 
[3] +LABEL 
[7] LABEL: (an APL expression) 


Line Label: aname for aline in a function. When the second of the 
above versions of line 3 is executed, the next line to be executed will be 
number 7 with the label ZABEL rather than number 4. 


Conditional Branch: branching to a function line on the basis of 
the *‘condition’’ of some APL expression. Example: 


[3] »?TRUEx12|N 
[4] 'THE CONDITION IS FALSE! 


[6] TRUE: (another APL expression) 
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If 2|¥” takes the value 1, the next executable statement is line [6 ] labelled 
TRUE. If 2|N takes the value 0, next executable statement is [4]. 


Quad, LU) (upper shift Z): when used as illustrated below with the 
function MD it displays the function uD: 


Vv MD[OI V 


)CLEAR: asystem command used to remove all existing functions 
and variables from the existing workspace. The workspace can be thought 
of as the part of the computer which is assigned to the user for his compu- 
tation procedures (both stored data and functions). 


)ERASE : (FUNCTION NAME): a system command that enables you to 
erase a function from your workspace. 


)SI: system command called state indicator. If you have func- 
tions which cannot complete their operation, they are said to be sus- 
pended. The command )STJ will indicate which functions are suspended 
and at what line in the function the suspension occurs. 

To clear a suspended function, type > as often as needed to get a blank 
response from the command )SI. 


APL Practice 


1. Let’s examine in some detail the use of the new APL functions pre- 
sented in this chapter. For X+ 5+111, compare the results of the oper- 
ations in each question. 


(a) 11x (j) ¥C12]<0,0 and then ask for X 
(b) 0|X (k) Xf1 1 2 3] 

(c) 2|X (1) xX014) 

(d) ~2|Xx (m) XLY¥X] and compare to X[1 pX] 
(e) 10]X (n) xLO] 

(f) x\0 (0) xf12] 

(g) xX] 10 (p) P+.4and then x[P] 

(h) X013) (q) X|X 

(i) XL2,3] 


2. Since you know that the meaning of 76 6 and 6 6 is equivalent to 206, 


why not combine them into 72206? Try: 
(a) ?3 9 6 
(b) ?3 p 0 
(c) 23 p 1 
(d) 21 9 1 
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(e) What is the difference between 376 and (a)? 
(f) What is the difference between pX,p Y,(oX),(o Y) and pX,Y for any 
arrays X and Y? 
Indexing can be used with the variables defined as a list of characters. 
Consider the following code game. Let 
AL<«'I WILL MEET YOU TOMORROW AT SEVEN! 


(a) AL[8 26 22 26] tells whom you will meet 

(b) ALL 22,9,29,17,26,15,22,26,33,17] tells where 

(c) ALL[10,26, 27] says what you will do 

(d) ALL 29,30, 31,32,33]+'SIX' changes the time of the meeting 


Let Z«"5+111 

(a) ((601),(500))/Z: picks the six first elements of Z, and is equiva- 
lent to Z[16]. 

(b) Show how to pick the last four elements of Z. 

(c) For any array Z write an APL expression that will pick the middlen 
elements. What if o2Z is even? 

(d) Is ZL YYZ] equal to Z[ 4421? 

(e) Suppose you want to delete the seventh element of Z. How would 
you ask the computer? 

Can you discover, with the computer’s help, which of the following 

series converge to a limit and which diverge? If the series converges, 

can you determine what the limit is? How accurate is your answer? 

Compare your result with a table of limits of series. 


2 3 4 
Osa ra as ee 


(fet tet tet tetas: 


(1+ 4) +(1+5) + (145) Fe 
os 
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6. 


2 4 


e! e e 

Tere pay Se ae 
_ log] | log 2 log 3 

@) 1+ log 1 + FF log 2 3+log3a 


Let the list M <— 3 3.2 3.5 3.6 3.6 3.7 4 4.2 4.5 5.1 5.3 5.5 represent 
the annual U.S. money supply in hundred billions of dollars for a 12 
year period. Define a new list consisting of the first differences of the 
elements of M. That is, for i = 2, 3, ..., 12, let DM; =M,- 


M,_, = 3.2 -— 3, 3.5 — 3.2, 3.6 — 3.5, . . . , eleven terms in total. 


What are the mean and variance? How about the list (M@; — M,_,)/ 
M,_,,i = 2,3, .. . , 12? The list log M;_,/log Mj, i=2,... , 12? 
(Hint: One suggestion for finding the first differences of a list is: 
V DM<DEF X3;M13M2 
[1] Mi+« (0,((pM)-1)p1)M 
[2] M2+ ((((pM)-1)p1)p,0)p/M 
[3] DM+M2-M1 
C4] Vv 


Notice that DMi<DEF M produces the first differences of mM. 
DM2+DEF DM1 would produce the second differences of MW, 
DM3+DEF DM2 would produce the third differences of Y, and so on. 
What are the mean values of the differences D/¥1, DM2, DM3 . . .?) 


Here are some important inequalities and identities. 
(a) The Cauchy-Schwarz inequality 


(DL ixyi) = 2h i?)(2hiy7) 
(b) Hélder’s inequality 
(DLixiyi) S (Shih) (aL ys) 
for any p and q such that 1/p + 1/qg=1, p> 1. 
P q P 
(c) Minkowski’s inequality 
[2h i(x; + y)XI% Ss Coa) + Coy" 
for xy; = Oand K > 1 


(d) For >“,a; and =/_,b, convergent series of positive numbers such 
that 2a; =-2b;, we have 


LiLiajlog(b;/a;) = 0 
(e) The Lagrange identity 
(2h1x7) (Diy?) — (SExy)? = Vey; — xsi)? 


In some advanced statistics courses, these inequalities and identities 
are proven by algebraic methods. However, here is a way to use the 
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computer to be fairly certain that the result can be proven algebra- 
ically. 

Pick four integers at random (Hint: use ?); say you get 49 10 15. For 
each integer pick a random sample of numbers for list x and another for 
a list y, where the number of elements in each list is in turn 49 10 15. 
Check to see that the suggested inequalities hold with each sample. 
Repeat the same experiment using different random lists x and y each 
time. 

If you discover no reversal of the inequalities (or violation of the 
identity), you may reasonably suspect that the result can be proven 
algebraically, and hence that it is a mathematically correct statement. 

The above procedure is known as a ‘‘sampling experiment,’’ and 
provides the basis for what are known as ‘‘Monte Carlo’’ procedures. 
Monte Carlo procedures often are used to solve mathematical prob- 
lems that are difficult to solve by analysis. 


Enter the function MD X (page 48) into your workspace. 

(a) Display the function. 

(b) Describe what the function instructs the computer to do at line [3]. 

(c) Add the line ‘The Median of X is’ between lines [5] and [6]. 

(d) Execute the function and press the ATTN key before you get any 
answer (i.e., while the computer is in execution mode). Press the 
command )ST7. 

(e) Get out of the suspension. 

(f) What would the computer do if you changed line number [6] of the 
original function into >1 and the number of elements in X is even? 


Statistical Exercises 


1. 


Let C<_"1 + 7100 p 2 represent the outcome from tossing a coin 100 
times. The random variable C takes on the value 0 if a ‘“‘head”’ and | if 
a ‘‘tail.’’ If a head occurs you win one dollar, while if a tail occurs you 
lose one dollar. Calculate how many dollars you expect to win or lose 
if you play this game with the computer. Try it four times. What would 
you pay to play this game? 


The standard deviation of the binomial distribution with parameters n 
and p is given by the formula S = V/np(1 — p), where n is the sample 
size and p is the probability of success in a single trial. For 2 — 100 
and p <— (10) + 10 find the value of p for which S is maximum. Is 
there a minimum? 


Your friend from Texas called you up and asked you to find out what 
the average price is of used cars sold in N.Y. You collect a small 
sample of New York used car prices as shown in the following table. 
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P is the price of the car in thousands of dollars rounded to the nearest 
0.5 thousand, and F is the frequency, i.e., how many cars you ob- 
served at each respective price. You have a sample size of 102. Calcu- 
late the sample mean, variance, and standard deviation. Plot the his- 
togram of P vs. F. How would you convey to your friend, who does 
not know any statistical theory, the information you have collected? 
How would you advise her? 


4. One hundred cans of floor wax, randomly selected from a large pro- 
duction lot, have the following net weight in ounces with the corre- 
sponding frequencies: 
weight | 19 19.2 19.4 196 19.8 20 20.2 20.4 20.6 20.8 21 
frequency | 0 3 8 9 16 2 15 12 4 5 6 


(a) Find the sample mean weight (x). 

(b) Find the sample median (Mx) and compare its value to the mean. 
Explain the difference. 

(c) What is the value of the sample standard deviation S$? 

(d) What percentage of the sample falls into each of the following 
intervals: 


1. 


3.x+3S8 


(e) The variation in production by net weight is ‘‘acceptable’’ when 
the mean value is 20 ounces and 95% of the cans produced contain 
no more than 20.3 nor less than 19.7 ounces. Can we conclude that 
the production run is acceptable on the basis of this sample? 


5. The formula P(y;) = (0.1) (y + 1) gives the probability of y = 0, 1, 2,3 

occurring. 

(a) Graph P(y;). 

(b) Verify that P(y) is a probability distribution function, that is, 
that =7_, P(y;) = 1. 

(c) Calculate the cumulative probability distribution function of y and 
graph it. 

(d) Find the expected value of y. 

(e) What is the value of P(y < 3). That is, what is the probability of y 
being strictly smaller than 3? 

(f) What is the probability of y being smaller than 0? 

(g) What is the probability of y being smaller than 10? 

(h) Calculate the standard deviation of y. 

(i) Calculate the third and fourth moments about the mean. 


‘ 
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6. 


Suppose u is a random variable that assumes the integer values from 
one to ten, each with equal probability of 0.1. Use the computer to do 
the following: 


(a) Pick 20 random samples, each of size 30, from the distribution of u. 


(b) Calculate the mean value and sample standard deviation of u for 
each of the 20 samples. 


(c) Print the vector of mean values and call it UV. 


(d) Calculate and print the sample standard deviation of the 20 mean 
values. (Call it SY). 

(e) Calculate and print the percentage of the samples whose means fall 
into the intervals 


1. 5.5 + 18u 
2. 5.5 + 2Su 
3. 5.5 + 3S8u 
where 5.5 is the mean of the distribution w. 
(f) Calculate and print the percentage of the samples for which the 
intervals 
lu, + 
2. u;, + 2Su; 
3. a + 3Su; i=1,2,...,20 
include the number 5.5. 
What is the connection between the results in (e) and (f)? 


Ravel, , 


Some More Statistics 


The major objectives of this chapter are to enable you to calculate a few 
more simple statistics and probabilities, practice your computer skills, and 
learn some more useful APL functions. 


Ravel 


First, let us clear up a small difficulty with the routine we used to calcu- 
late the arithmetic mean (see page 20). We calculated WV, the number of 
observations, by V¥+oX where X is the array of numbers whose mean we 
wanted. What if X is a scalar, not an array? In this special case, we would 
like the mean of X to be X; but we will not get that result by using V«pX, 
since oX is ‘‘blank.’’ There is an easy way out: our friend the comma (,), 
used this time as a monadic function, is called ravel. Ravel simply makes its 
argument an array, whatever its shape was to start with. Thus if Xis scalar, 
then ,X is an array of length one. Try the following: 

X*3 
(+/X)+# pX 


Nothing seems to have happened. There is no response’since division by 
‘‘blank’’ is undefined. Now try 


(+/X)+0,X 
3 


Now we have the correct result. 
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5.1 Some Basic Statistics 


APL Routine to Let’s consider a routine that will calculate a number of basic statistics, say 
Compute the mean, median, variance, standard deviation, the range, and so on. To 
Mean, Median, increase our understanding of how to define and use functions, let us define 
Variance, Standard a function which has a single argument which is an array X, does the 
Deviation required calculations, and prints out the results, but does not yield an 


explicit result.* That is, the routine performs a series of operations but 


does 


[18] 
[19] 
[20] 
£21] 


[22] 


[23] 


* This 


not return a value that can be used in other calculations. Type in 
VDSTAT X 

Re (MAX*X[ p X))-MIN© (X<X04X])(1] 
SD+ (VAR+ (+/ (X-MEAN+ (4+/X)#N)*2)+ (NH p X)-1)*0.5 
MD<( +/|X-MEAN)+N 
MED+*O.5x+/XC([N#2),1+LNi2] 
"SAMPLE SIZE' 
N 
'MAXIMUM' 
MAX 
"MINIMUM! 

MIN 

"RANGE! 

R 

"MEAN! 

MEAN 

"VARITANCE* 

VAR 

"STANDARD DEVIATION' 

SD 

"MEAN DEVIATION! 

MD 

"MEDIAN! 

MED 


routine is a modified version of one appearing in Smillie (1969, p. 16). 
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Hold it. Don’t type the final V until you are sure that you have typed 
everything just as it’s written above! If you made a mistake in any line, 
correct it as you did in Chapter 4. When you are sure that the function is 


correct, close it out. 


[23] V 


Let’s see if the function works. Type in our old friends the X¥ and W 


arrays. 


Xe12°3 457 
We9 103812015 


DSTAT X 
SAMPLE SIZE 
7 
MAXIMUM 
5 
MINIMUM 
“7 
RANGE 
12 
MEAN 
“1.7143 
VARIANCE 
19.905 
STANDARD DEVIATION 
4.4615 
MEAN DEVIATION 
3.7551 
MEDIAN 
“3 
x 
12° 3 45° 6°77 
DSTAT W 
SAMPLE SIZE 


8 
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) ERASE 


Correcting a 
Function Line 
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MAXIMUM 
12 
MINIMUM 
0 

RANGE 


VARIANCE 
19.429 
STANDARD DEVIATION 
4.4078 
MEAN DEVIATION 
3.75 
MEDIAN 
6.5 

W 
910381201 5 


Correcting a Function Line 


It seems as if our function works, but what if yours did not? 

If your function did not produce the same results as ours, but otherwise 
appeared to work, then first of all check that the variables ¥ and W are 
exactly the same as ours. If they are, but you still got some strange results 
or statements about errors in the routine, then you know that you have 
made a mistake in defining the function. 

You know that you can always use the system command 


)ERASE DSTAT 


to remove the function. But there is a better way! 

First, in case your function is suspended, remove the suspension in the 
way that we showed you in the last chapter. Then check with the )SI 
system command to ensure that the suspension is completely removed. 

The next thing to do is to see how we can correct a line in our function. 
Let’s suppose that you have discovered your mistake lies in line [3]. You 
want to correct just that line. Here’s how to do it. Type in 
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Name of Function 


V DSTAT[T3] {enter the correct APL expression} V 


Indicates Statement Number Closes the 
Function Definition of Line to Correct Function Definition 
Mode Mode 


and then hit the RETURN or EXECUTE key. For example: 
V DSTATL3] MD+(+/|X-MEAN)+NV 


If you have several lines to correct you can use the above procedure 
repeatedly. 

Now that you have a working function, let’s see how it works. Consider 
the first line: 


C1] Re (MAX+X[pX])-MIN<( X+XCAX]) [1] 
Starting from the right, the first group of symbols needing our attention is 
(X«X( 4X) 1) 


Let’s skip the [1] for a moment. First, we sort the array X, and then we 
replace the old X array by the new sorted array X. Thus (X<X[4X]) results 
in an array. X[ 4X] reorders the elements of xX in ascending order, using the 
grade up function 4, and the reordered elements are replaced in the X 
array. In short, X is now an array containing the original elements of X 
rearranged in ascending order. Now the [1] in 


(X«X4X]) (1) 


gives us the first and, therefore, minimum element of X. This result is 
promptly placed into a variable called MIN by 


MIN+(X+X(AX])01] 


The next function, —, instructs us to subtract MIN from the next argu- 
ment, (MAX<XCpX]), and to put the result in R. Note that (proceeding from 
right to left) after MIN has been calculated, X is in ascending order. The 
maximum of X, which is its last element, is therefore given very simply by 
X(oX]. We now have the ‘‘range’’ of the array, i.e., the difference between 
the maximum and minimum of X. 

After line [1], line [2] should be easy. Reading from the right, after )+0.5 
we have: 


(N+pX)-1 stores number of observations in N and 
computes (N — 1) 

(+/(X-MEAN+(+/X)+N)*2)computes 2x; — x)? 

(X-MEAN+(+/X)+=N) computes the array X¥-MEAN 


Finally, raising the whole expression to the 0.5 power (computing the 
square root) yields the standard deviation. In tracing this through, re- 
member to match up the parentheses in pairs. For example, in the first term 
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Absolute Value 
Function, | 


Floor, | Ceiling, [ 
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above, one does not have (N — 1)! as you might at first think, because 


(N+ oX)-1)*0.5 


involves right and left parentheses that are not matched until you reach the 
parenthesis to the left of VAR. 


Absolute Value, Floor, and Ceiling 


Line [3] involves the monadic use of |, which produces the absolute 
value of a number (discussed in the previous chapter). Try keying in 


3.2 
|-2.6 
2.6 
XABS+|X 
XABS 
1234567 


Line [4] shows the quick way to get the median and introduces two 
new, but related, functions: Floor | and ceiling |, both used here as 
monadic functions. Floor | rounds its argument down to the next lowest 
integer, whereas ceiling [ rounds up to the next highest integer. Floor | is 
keyed in as upper shift D; ceiling as upper shift S. For example, 


be Dis Debt 


ape eencee IA ear 

o°63 

If V is even, say 6, then the index function 
XE ( fwe2),1+ L we2) 


yields the third and fourth elements of X, which was reordered into ascend- 
ing order in line [1]. If Wis odd, say 7, then 


XC(P +2) 14+ lL N2] 


produces the fourth (middle) element of X¥ twice. Plus reduction followed 
by multiplication by 0.5 gives the correct answer in both cases. 

Line [4] is clearly a much better way of calculating the median than is 
the procedure we used earlier. Since that approach has now served its 


5.2 Dummy, Local, 
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purpose, you should use line [4] or something like it for calculating a 
median. But the experience provided an important lesson: the first way you 
think of doing something is probably not the most efficient way—so you 
might think of some alternatives before proceeding with your attempt. 


and Global Variables 


Global Variables 


Dummy Variables 


You may have wondered why we printed out the arrays X and W after 
using the function DSTAT in §5.1. However, if you think about what was 
done in line [1] in DSTAT, you might instead wonder why it is that although 
we rearranged the arrays X and W into ascending order in line [1], they 
were in their original order when we printed them out. The explanation for 
this result is that the variable X, which occurs in both the header and the 
body of the function, is ‘‘local’’ to the function. So anything that happens 
to an X inside the function does not affect any X that exists outside the 
function. This idea seems intriguing, so let’s pursue it. 

Up until now, you have become accustomed to the idea that if a variable 
is redefined in some function, then the old definition is lost and only the 
new one prevails. For example, 


N+6 
N 
6 
N+(N+1)*2 
N 
49 


and so on. As we shall see, the variables we have been using so far are 
known as ‘‘global variables’ or, rather, the way in which we have defined 
and used these variables makes them global variables. The values rep- 
resented by the variable names which are globally defined can be called 
and used by any APL expression at any time. 


Dummy Variables 


The variables that are used in a function header as the arguments of the 
function are known as dummy variables; these variables are in fact defined 
only within the function itself and nowhere else! For example, if you start 
with a clear workspace, we can illustrate this idea with an experiment. 


Type 


)ERASE A B This is to ensure that A, Bare not defined 


as variables anywhere in your workspace. 
VC<A DUM B y y ocr 
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L1} C+A+1+B+3 V This sets up a function with A, B 


% as arguments in the function. 


VABUE ERROR 


A This indicates that after defining the 
function DUM, both Aand B are still 


i undefined in the computer workspace, 


B 
VALUE ERROR 

B 

A 


A DUM B We get a VALUE ERROR because A, B 

are undefined outside the function DUM, so 
that we cannot use undefined variables 

A DUM B as arguments for the function. 


VALUE ERROR 


A 
Suppose we assign zero to both 4 and B, thereby defining 


A+B+0 Here we use the function and it 
ABS produces the value 4. In this case 
the result is completely independent of 
m whatever we have defined 4 or B to be 
outside the function. 
a So, even after execution, A, B 
0 still have the values assigned 
to them initially. Although the 
B function produced the expected 
0 result of 4, A and B have not been 
changed to 4 and 3, respectively. 


We see from this experiment that the variables used as arguments in the 
definition of a function are distinct from global variables. When A and 8 
were set to 0 by use of the replacement statement, they were defined 
globally. The variables in the definition of a function serve a role distinct 
from that of variables generally. 

Consider the mathematical statement defining the summation of a set of 
n numbers (a,, do, ..., Gy): 


N 
1% 


In this definition, the index 7 is a dummy variable also—we could use any 
letter and get the same result. Thus 


La, = La; = La, 


and so on. Correspondingly, in our function DUM, the variables 4 and B used 
in the definition served the role of dummy variables. They merely indicate 
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each role that the first and second arguments play in the function. If we 
substituted any other two variable names in the function definition we 
would get the same results on execution. But when we come to the execu- 
tion of a function, globally defined variables must be used in calling the 
function; otherwise a value error will be generated. For a function to 
perform its assigned task it must have some values on which to operate. 
Assigning values to variable names by the assignment operator <— does 
just that. 

The above example of summation indices gives us an idea why we might 
want to distinguish global and dummy variables. First of all, we see that we 
will want our function definitions to be general in the sense that we would 
like to define the function once and then be able to use it with any variables 
we choose. When we define the function we will have to use some symbols 
to represent the arguments. Consequently, you can see the benefits of 
deciding that variable names specified in function definitions should have 
no meaning outside the definition of the function, and that to use the 
function global variables must be specified. 

When you become more expert at computing you will discover that you 
will have to handle a large number of variables, and that it is very easy to 
forget what is what. A part of the problem is that it is very easy to give the 
same variable name to two different variables—and you already have dis- 
covered that the second assignment replaces the first. In any case, you can 
appreciate how confusing it all can be. Consequently, it is a relief to know 
that when you define a function and the use of its arguments, you will not 
affect the values of any global variables that you have already defined and 
want to keep. 

What about variables that appear in the body of a function, but not in the 
header? These are also global variables and they will be defined when the 
function is executed, but not before. For only when the function is executed 
are the operations defined in the function actually carried out. Before then 
the operations have been defined, but not yet used. 


Local Variables 


The idea of variables that appear as arguments in the header of the 
function, so that they are defined only in the context of the definition of the 
function, can be extended to ‘‘local variables.’’ Local variables are defined 
in the header by separating the specification of the local variables from the 
function header (and from each other) by semicolons. For example, con- 
sider the following headers: 


VR<A GET B3X3Y 
VA BY B:sxX:Y 
VY FUNC3X 


X is alocal variable in each of the above functions and Y is a local variable 
in the first two. Note that local variables can be defined with an explicit 
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functional result (function GET), without an explicit functional result (func- 
tion BY), and even for functions without arguments, as in FUNC. 

Local variables differ in several respects from dummy argument vari- 
ables. When a function is used, that is, when the computer is instructed to 
perform the mathematical operations specified by the function definition, 
then global variables must be inserted where the dummy variables 
appear—a one-argument function needs one global variable to operate on; 
a two-argument function needs two global variables to work on. But the 
local variables, which are specified in the header, are defined in the body of 
the function and do not have to (indeed, must not) be supplied by you when 
using the function. For example, one would call the above functions by 


P GET Q, or 
S BY T, or 
FUNC, 


where P, Q, S, Tare global variables defined elsewhere in the computer 
before these functions are executed. 
Consider a use of a local variable: 


X41 2°3 45 6°97 
V FUNC;X 

C1] x+2 

C2] x Vv 
FUNC 


x 
De Gy NG, eee 

We see from this example that the globally defined X and the X defined 
locally to the function FUNC are two entirely different variables. The 
specification of local variables lets you extend the advantages of the 
dummy variables to more variables. 

Dummy variables that appear as arguments in the header of a function 
when defining it enable one to define the mathematical operations to be 
performed. Use of the function requires substituting previously defined 
global variables. 

Local variables enable us to reuse variable names within a function 
without affecting whatever definitions these variable names may have 
elsewhere. 
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Ravel, *‘,’: a monadic function that makes its argument a one- 
dimensional array. 
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DSTAT: aroutine in this book defined to calculate maximum and 
minimum values, range, mean, variance, standard deviation, mean devia- 
tion, and median. 

A statement, say the third, inside a function routine can be corrected by: 


V DSTAT[3] {correct APL expression} 7 
Absolute value, | (upper shift 4): a monadic function that gives 


the absolute value of a number. 


Floor, L (upper shift): rounds number down to the next lowest 
integer. 


Ceiling, [ (upper shift): rounds a number up to the next highest 
integer. 


Dummy Variable: variable names used in the definition of a func- 
tion’s arguments, are not defined outside the function. 


Global Variables: variables defined by the assignment operator, 
can be used by any APL expression or routine. 


Local Variables: variables defined only within a function, are 
designated in the header. 


EXAMPLE: 

A3 

V AX FNCT ¥32Z1322323 
[1] B+ some APL expression 


[2] R« some APL expression 


[3] 
.. V 
Result Variable: 4 
Dummy Variables: X,Y 
Local Variables: Z1, 22, 23 
Global Variables: A,B 


APL Practice 
1. Let X<(€?10 p 10)+2 
Drill 
(a) 4 px 
(b) ( pX) pX which is of course _X 


(c) 30 9 'X' andtry 3+30 p 'X'. (You get a Domain Error because x 
is an array of characters, not numbers.) 


(d) p(pX)oX 
(e) Is [3.4 the same as | 3.4+1? 
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(f) lL x 

(g) Is [X the same as | X+1? 

(h) Isl X the same as LX? 

(i) Is[[X the same as [X? 

(j) What do you get by using | /X; or[/X? Compare L\X and! \X. 
(k) Is X-1|X the same as LX? 

(l) Is X+(1-L |X) the same as [X? What if some X; are integers? 
(m) Is ([/X) the same as (X[AY])[pX]? 

(n) Find the two largest values of X. 

(o) Find the third smallest value of X. 
Let W+(~5+19)#2, YY+ifW, and Z+«|W, ie.,Z is the array of the 
absolute values of W. 

(a) Compare [W to [Z and [ YY. 
(b) Compare |W toLZ and LYY. 

(c) Compare Z to (Wx2)x*.5. 
(d) Compare ~-2 to -(Wx10)*.1. 
Consider the polynomial f(X) = 10X — X?, where X¥+ 51+1101, ie., X 


takes only the integer values from ~50 to 50. For these values of X, f(X) 
might be either positive or negative. 


(a) Find the maximum value of f(X). 
(b) Find the values of X for which f(X) is positive. 
(c) Display all the negative values of f(X). 


Using X+(~51+1101)+10 and f(X) = —X? + 3X? 4+ 1, find the two 
local maxima of f(X) as well as the one local minimum. 


For any array V verify that the following APL expressions are equiva- 
lent. 


(a) (+/V)= pV 

(b) +/V: pV 

(c) V+. pV 

(d) 1+.xV=V+.*0 

Put the following function into your workspace: 
V J*JOHN T3A3B3C0;3D 

[i] Asaixt 

[2] B+2xI 

[3] C+3xI 

Pu] Deux 

[5] J+A+B+C+D 


[6] V 
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(a) What does the function do? 
(b) Which variables are local? 
(c) Which variables are global? 
*(d) Which variables are dummy? 
(e) Which variable contains the result? 
(f) Is the function monadic, niladic or dyadic? 
(g) Does the function give you an explicit result? 
7. Consider the polynomial f(X) = —1-X° + 2X! + 3X? + 5X? — 10X14 + 
2X*°, Let B+" 1 2 3.5 10 2 be the array of coefficients and P+15 the 


array of the exponents of ¥. Then &+.xX*P evaluates the polyno- 
mial for X = K. 


(a) Evaluate the polynomial for K = —5 and K = 3. 
(b) Does this polynomial have a root for —10 < X < 10? 
8. Refer to exercise 7. Suppose you discover that there is one root be- 


tween 0 and 1. Write a program that will find the root to the nearest 
thousandth. 


Statistical Applications 

1. Use your DSTAT X function and the samples 
We1 234567 8 9 10 114 
2-1 245567 7 8 10 11 
Yee Dot Oe 40 10 aa aa: Ad 


to verify that 

(a) All three samples have the same number of elements, the same 
maximum, the same minimum, the same range, the same sample 
mean, and the same median. 

(b) The median is equal to the mean for all three samples. 

(c) Z has the smallest mean deviation, standard deviation, and vari- 
ance. 

2. Use the dyadic use of the ceiling and floor functions to play the follow- 
ing game with the computer. Toss a coin 100 times. You win $1 
whenever a head occurs, otherwise you lose $1. You start the game 
with $20. During the game record: 

(a) How many times you were in deficit. 

(b) How many times you were in surplus. 

(c) What your maximum profit was and at which toss it occurred. 

(d) What your maximum loss was and at which toss it occurred. 

(e) How many times you switched from a surplus to a deficit posi- 
tion. 
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Some More Statistics 


Did you go broke by the end of the game? 


A random variable X takes the values 1,2,. . . , 11, each with prob- 
ability of (1/11). Use the scan operator and the random number genera- 
tor to confirm the following two arguments. 

(a) As the sample size gets large the sample mean ‘‘approaches”’ (or 
fluctuates more closely around) the number 6, which is the popula- 
tion mean. 

(b) As the sample size gets bigger the sample variance ‘‘approaches”’ 
(or fluctuates closely around) the number 10.083, which is the 
population variance. 


The numbers of admissions to the emergency ward of a hospital be- 
tween 4 and 8 p.m. for a period of 20 days were 0 1302345053634 
011425. 

(a) Use your DSTA? X function to calculate the maximum, minimum, 
range, mean, variance, standard deviation, mean deviation, and 
median. 

(b) Unexpectedly, on the 21st day there were 11 admissions. In light 
of this information recalculate the measures asked for in (a) and 
find which ones are affected (i.e., increase or decrease), and which 
remain unchanged. 


The following table gives the population growth rates for various re- 
gions of the world for the year 1970: 


Region Population in Millions Annual Growth Rate in % 
Europe 470 0.8 
U.S.S.R. 240 1.1 
N. America 230 1.3 
Oceania 20 2.1 
2100 2.3 
Africa 350 2.6 
S. America 290 2.9 
World 3700 


Source: 1974 American Almanac, Table 1322. 


(a) Find the weighted average of the world population growth rate. 
Which average population (weighted, unweighted, geometric, or 
harmonic) is the most appropriate? 

(b) Use the computer to predict the world population for the year 
2050 using the appropriate estimate for the growth rate. Compare 
this to the estimate obtained by adding the separate estimates for 
each region. Assume that the world growth rate will not change. 


Refer to exercise 1. Verify that in no case is the mean deviation greater 
than the standard deviation. 


Redefine the DSTAT X function as F DSTAT1 X, where F is the rela- 
tive frequency of X. Use this function to solve the following problem. 
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A random variable X is defined as follows: 
—2 with probability 1/3 
X= 3 with probability 1/2 
1 with probability 1/6 
Calculate all measures of location given by the function for 
(a) X 
(b) X? 
(c) 2X + 3 
(d) 2X? + 3X + 1 
(e) Is the variance of 2X + 3 twice the variance of X? 
(f) What is the relationship (if any) between the mean value of X and 
the mean value of X?? 
8. A population consists of six elements with X values 10 11 12 13 14 15. 
One element is picked at random. 
(a) What is the expected value of X? 
(b) What is the expected value of X?? 
(c) Suppose that the elements are circles and the X-values are their 
diameters. What is the expected value of the area of such a circle? 


(d) Suppose that the X values are dollars per day that you carry in 
your pocket during a 6-day period, and that you always spend $4 
plus 80% of whatever is left. What are your expected expenditures 
on a randomly selected day? 

(e) Suppose that the X values are the number of cars per minute that 
pass through six gates at a toll road. Find the expected number of 
cars going through the six gates per hour. 


9. LetX take the values 0, 1, 2, 3, 4, and let Y = X?/30 be its probability 
distribution function. First verify that Y is a probability distribution 
function, i.e., that Y, > 0 and X4Y; = 1. Find E(X) and the variance 
of X. 


Higher Order 
Sample Moments 


72 


6 


Higher 

and Cross Product 
Moments 

and Distributions 


Higher Order Sample Moments 


So far we have restricted our attention to the first two sample moments: 
the arithmetic mean and the sample variance. Let us write a function to 
calculate ther th sample moment about the mean, where r is greater than or 
equal to 2. Mathematically, what we want to calculate is 


Zt — Y'/(N — 1) 
From what we have learned so far, we can write the answer down. Try 


doing so by working outwards from (x; — x) in the mathematical expres- 
sion. Thus, write down on a piece of paper 


First effort: X-MEAN+( +/X)+N 
Second: Me-(+/(X-MEAN+(+/X) +N) *R) 
Third: Me(+/(X-MEAN+(+/X)#N)*R) + (N* p ,X)-1 
Fourth: VMcR MNTS X 
[1] Me(+/(X-MEAN«(+/X)+N)+*R) + (Ne op ,X)-1 
E21 ¥ 


Now that we have it defined, let’s try using it. Try 
3 MNTS X 
22.245 
4 MNTS X 
570.82 
5 MNTS xX 
1477.4 
6 MNTS xX 


20467 


Covariance 
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We have a function that produces an explicit result and has two arguments. 


Covariance 


Another simple function we can write down is one that calculates the 
covariance between two arrays. The mathematical statement is 


Cov (x, y) = 2%; — % (i — Y/N 
= Lixyi/N — xy 


where x, y are the means of the arrays x and y. Try to write down the APL 
expressions on a piece of paper. 


First effort: ((X+.xY)tW)-((4+/X)*(+/Y) ) sN*2 
Second: ((X+.x¥)-((+/X)x(4+/Y) tN) tN& p ,X 
Third: VC+X COV Y 


C1] CHCCX+.xY)-((+/X)x(4+/Y) FN) tN~ 9 XV 
To use this new function, we will need another array, say Y. Type in 
X1 2°73 45 6 7 
Y2 4684 26 
X COV Y 
2.7347 
Y COV X 


"2.7347 


6.1 Some Useful Distributions (Binomial, Poisson) 


Binomial 
Coefficients 


One of the first distributions that you would encounter in your studies of 
statistics would be the binomial distribution, which has associated with it 
the ubiquitous binomial coefficients. 


Binomial Coefficients 


Mathematically, we define the binomial coefficient by 


()=so0n 


forr=0,1,2,...,n,andwheren!=1x2x3x--:+ Xa,r!=1*x 
2x3xX--:+xr,and(n —r)!=1x2x3x--:: X (n —71r). The nota- 
tion n! is called n factorial. As you may recall, the term (7) represents 
the rth term in the expansion of the polynomial (a + b)". The binomial 
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probability distribution is given by 
(‘)p"U — p)"* 


where 0<p<1 is the probability of some event occurring, and 

"p71 — p)""" is the probability of getting r successes in n independent 
trials. (”) is also the number of ways that n objects can be combined r ata 
time. 

In APL, factorial and combinatorial functions are handled very simply. 
We use |, ‘‘shriek,’’ or the exclamation point, which is keyed by upper 
shift X, backspace, period. Thus, !, as a monadic function, produces 1 
factorial by executing !7. Try 


13 
6 

15 
120 
Now try: 

!0 
1 

6 
720 

Hh3 
720 


The combinatorial function that we just discussed is obtained from the 
dyadic use of !, that is, (”) or the number of combinations ofr things takenn 
at a time is given by 2!1/. R must not exceed J, otherwise the result is zero. 
Try. 


315 
10 

O'S 
41 

5!5 
1 


Binomial Distribution 


Using ! we can write a function to yield binomial probabilities as defined 
by ()p’(1 — p)"-"; indeed we can just write the function out. Try the 
following: 


Binomial 
Distribution 


Poisson Distribution 
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VPR<«N BI P 
[1] PR<« (R!N)x(PxR)x (1-P)*N-R<O,1N 
[21 V 


PR js aresult array with n + 1 elements in it, where the rth element is the 
probability of r successes in n independent trials. In B7, Ris computed to 
be an (nm + 1)-element array with elements 0, 1, 2,...,n. Try 


5 BI .5 

0.03125 0.15625 0.3125 0.3125 0.25625 0.031425 
5 BI .2 

0.32768 0.4096 0.2048 0.0512 0.0064 0.00032 
5 BI .8 

0.00032 0.0064 0.0512 0.2048 0.4096 0.32768 


Probabilities are meant to sum to 1. If we have done a reasonable job of 
calculating these probabilities, we should be able to add them up to get 1. 
Let’s try: 


+/5 BI .5 


+/5 BI .2 


Poisson Distribution 


Another important discrete distribution is the Poisson distribution with 
mean value M. The probability distribution is defined mathematically by 
e!M* 

XxX! °? 


X=0,1,2,...ande=2.718... 


An APL function that generates Poisson probabilities is easily written. One 
minor problem is that we cannot write a routine to give ail the probabilities, 
because that would mean an infinite array length; even APL finds that 
difficult! Let us compromise and specify that we want only the first N 
probabilities. This suggests that we define a dyadic function. Consider 


VY PR<N POISSON M 
[1] PR<(*-M)x (MeX)=!X<0,14 
[2] Vv 


X is an(N + 1)-length array, as is PR, which contains the probabilities of 
0,1,2,.. . up toWN successes in an infinite number of trials. 
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For example, 
5 POISSON .5 
0.60653 0.30327 0.075816 0.012636 0.0015795 0.00015795 


One frequently wants the cumulative probabilities, that is, the sum of the 
probabilities from 0to R,0 < R = N, for the binomial distribution, or 0 toR 
for some integer R for the Poisson distribution. Thus, the sum of probabil- 
ities for 0 to R gives the probability of no more than R successes. Corre- 
spondingly, the probability of at least R successes is 1 ~ 2?"'PR;. Let us 
use our binomial probability function to generate an array of cumulative 
binomial probabilities. This involves the use of a primitive function that we 
haven’t used very much, \, scan (typed by striking upper case /); see page 
28 for its definition. The function we require is 


VCB<N CUMBI P 
[1] CB<+\N BI P 
[2] Vv 


Note that we have a function inside a function, which is perfectly okay. 
This is referred to as one function calling another. In fact, a function can 
call itself. That technique is called recursive programming; it is an interest- 
ing subject, and Barron [1968] is a good reference. Returning to cumulative 
probabilities, let’s try 


5 CUMBI .5 
0.03125 0.1875 0.5 0.8125 0.96875 1 
5 CUMBI .2 


0.32768 0.73728 0.94208 0.99328 0.99968 1 


Histograms 


It is a trite saying that a picture is worth a thousand words. That doesn’t 
stop us from repeating it, and noting that in programming it seems that to 
get one simple picture we need to use 10,000 words. This is not so in APL. 
Let us write a simple program for getting a histogram from an array of 
absolute frequencies. Because the number of entries in some cells may be 
very large, it would be useful to have a simple way of scaling the histogram 
down so that our plots do not take up pages and pages of output. For 
example, if we have a set of absolute frequencies whose sum is 100, divid- 
ing each frequency by 10 and rounding off should produce a useful histo- 
gram.* Suppose that we have an array F of absolute frequencies. Consider 


V GS HIST F3M;K 
Ci] MA /F+ | O.5+FS 


* This routine is adapted from K. W. Smillie [1969, p. 20]. 


Dyadic Functions 
Maximum, [ and 
Minimum, | 
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[2] Ge('."),(P/K+<( F2M)/1pF)o' ' 
C3] Glk+1]<'?' 
fu] 4 
[5] +(0<MeM-1)/2 
Lod Cte pres. 
C7] G«10 
bSd¥ 
An example of the use of HIST is given by 
Fe3.8 10 209 74 
S+«2 
S HIST F 


coer m eens 


You will notice that the function starts by plotting a7 in the position of the 
largest frequency first, and then moves down toward the lower frequencies. 
In the first line, the operation | 0.5+F+S divides each frequency by S, 
adds 0.5 and rounds down to the nearest integer. In short, this procedure cor- 
rectly produces the usual rounding-off of numbers to the nearest integer 
after division of F by the scale factor, S. If F/S is 4.2 or 4.5, L0.5+FS 
produces 4 or 5, respectively. The rounded numbers are restored in F. 


Dyadic Functions Maximum and Minimum 


The dyadic operation [/F finds the maximum of the array F and stores 
the result in M. ['/F is equivalent to 


f, ff, ff... 


and the dyadic use of the symbol [, called maximum, is to produce the 
larger number of each pair compared. Thus 


3f5 
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513 


Y 
2468426 


P/Y 
8 


The symbol L, in its dyadic mode, is the minimum function. So line [1] in 
HIST produces an array F of scaled and integer-rounded frequencies and 
stores the largest frequency in M. 


Dyadic Function Reshape 


Line [2] of HIST introduces the dyadic use of p, called reshape. It is 
keyed by striking upper shift 2. The operation VpV will make V into an 
array of length N. If V is too big (too many elements), then only the first 
N elements of V will be used, and if V is too small, the elements of V will 
be repeated in sequence until the newly created array is of length N. Some 
examples of this function are 


20 Y/Y 


9p Nine is greater than the number of elements 
in Y, so the two extra elements came from 
the beginning of Y. 


Hop 3 
3.3 3 3 
4p At Keep this result in mind for the 
AAAA next paragraph. 
1p 2 
2 
0 2 Compare these last three examples 
0 Atl p 2 carefully. 
1 


Line [2] of HIST uses the symbols' ' (upper shift X). The use of a pair of 
quotation marks, as you probably remember, tells the computer to regard 
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whatever is between them as an array of ‘‘literals,’’ that is, characters that 
are not to be executed. Thus we get 


i oer 

3. pe PP 
TTT 

sip ot 
&&E 
and so on. 


Logical Functions 


Logical Functions <, In line [2] of 47ST the left-hand argument to the dyadic function p is 

S,=,2,>,%,A,%,V,  ([/K+(F2M)/10F). The result is an integer, although at the moment it all ap- 

Oy pears to be highly mysterious. The operation ipF, you may recall, pro- 
duces the array of index numbers 1,2,...,0F, where pF is the number of 
frequencies. 


The expression ( F2M) introduces a new type of primitive function, the so- 
called logical functions. Other examples of logical functions are: >, <, <, =, 
#, etc. Essentially, the logical function asks a question, say, Is the relation- 
ship a < b true or false? If true, return a one, if false return a zero. For 


example, 

3<1 
0 

5 < 3 
0 

4 > 4 
a 

2=1+1 
1 

243-1 
0 


You see that the output of a logical function, the result if you like of a 
logical comparison, is either 0 (false) or 1 (true). In our example, F is an 
array and M is the maximum value of the elements in the array F. If F has 
only one maximum, say in the /th position, then (F = M) produces an 
array of the same length as F with 0’s everywhere except in the /th posi- 


Logic Functions 
in Flow 
Systems 
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tion, which contains a 1. That is, (F =-M) looks like 000100. Try, for 
example 


3<21 23945 
000141 

10>14 234 
11141 


A second group of logical functions not only produces results that are 
either 1’s or 0’s but also requires that the arguments of the function be 1’s 
and 0’s. These logical functions are: ‘or’ v, ‘and’ a, and ‘not’ ~. To see 
how these functions work look at the Truth Table for Ff{M 


Truth Table 


The table gets its name from the fact that a statement like, ‘‘the mean of x 
is 22.3 and x has a variance of 2.3’’ is true if and only if both parts of the 
conjunction are true. We could have a long series of « (and’s) and for the 
statement to be true (result in a 1) every element would have to be true (be 
a 1). The statement, ‘‘the mean of x is 22.3 or the variance of x is 22.3,” is 
true if either part is true. We use the symbol ~ (not) to change a 1 to a0, or 
a0Qtoa1. And if vor Aare overstruck with ~ the results are ‘‘negated.”’ 

These logical functions have many uses outside of symbolic logic. For 
example, in modeling material or traffic flow you may have a process that 
can be diagrammed as 


— =F] 
— =} 


Here to stop the process you need to have both F and M stopped. To 
obtain flow, either one needs to be on. If the system looked like 


— =F 


both F and M would have to be on to obtain production or flow and either 
could be off to stop it. You can see the direct relation between these 
situations and our logical functions. After we show you the compress 
function we will show you some numerical applications of these ideas. 


iME—= —— 


Compression / 
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Compression 


We can now consider the dyadic use of /, which is called compression. 
The left argument must be composed of 1’s and 0’s only. Both arguments 
must be arrays of the same length, except that the left-hand side may be a 
scalar. What happens is this. Given an array, say A, of 0’s and 1’s on the 
left-hand side, and an array, say B, of equal length on the right, then A /B 
produces an array whose length is equal to the number of 1’s in A. 
Whenever a; is 0, b; is dropped, and whenever a; is 1, b; is retained. For 
example, 


Aoo0Oto01t004 
B<1 2345678 
A/B 


1/B 
1234567 8 


O/B 


The blank space after 0/B means that the 0 compression of B is an empty 
vector which, when you consider the matter, is natural enough. 

Back to the expression (| /K«(F2M)/1 pF). What is put into K is the 
array of index numbers where F has maximum frequency. [| /x picks out the 
largest of these index numbers. This determines the number of blank 
spaces to be catenated to the symbol '.'. This completes line [2] of AIST. 

Line [3] gives rise to no difficulties. K is an array (even if it has only one 
element), of which each element is an index number of the array F which 
has maximal value. What line [3] does is to put the character T, or the 
literal ‘T’ in each position indicated by the index numbers K + 1, recogniz- 
ing that the first element of G is'.'. Line [4] merely prints G. 

Line [5] decides whether to continue to line [6] or go back to line [2]. 
What this routine does is to start at the top of the histogram and work its 
way down to the bottom, so line [5] first reduces M by 1. If the value of M 
is greater than 0, then 1/2 gives 2 and +2 means go to line | 21. If M reduced 
by 1 is less than 0, then 0/2 gives an empty array. The computer interprets 
such a statement as one to be ignored, so it continues to the next line. 

Every time M is reduced, more frequencies become eligible to be bigger 
than M, so more 7’s will be printed. 

Line [6], reached when M has been reduced to ~1, merely prints a row of 
periods across the bottom of the graph. Line [7] makes G an empty array 
and the routine is ended. The reason for this is that if G were not redefined 
as an empty array, the completion of the function would print the contents 
of G, since an explicit result is specified in the function header. 

You might note that this routine has three dummy variables—G, S, and 
F; two local variables—-V and K; and no global variables. Try the follow- 
ing examples: 
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S10 
Fet 6 28 42 23 
S AIST F 


FeO 0000101 5 10 17 21 20 15 7 30 
5S+10 


S AIST FP 
PITT 
PETE LE 


 ) 


TTTTTT 
TTTTTIT 
LT PELL 
LPTTL ITE 
LITLTL LT 
LEPET TITLED 


Ce 


In some cases you may want to look at the histogram of a range of your 
data, maybe the central portion, or possibly select the observations on a 
series of criteria. We might have a vector JNC containing a sample of 
personal incomes and only want to look at those greater than $20,000 and 
less than or equal to $50,000. The APL statement would be 


((X>20000 )ACXS50000) )/X 


Here we are combining the two types of logical variables and using dyadic 
compression. Each operation in parentheses produces a logical vector with 
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the shape oX. After these three functions are computed, compression 
selects particular elements of X that are both larger than 20,000 and equal 
to or smaller than 50,000. This statement could have had an assignment to 
a new variable F or it could have been written as 


S HIST ((X>20000) a (XS50000))/X 


Another situation arises in presenting statistical results and in computa- 
tion in general; and that is selection of data from a larger data base. For 
example, along with income we may have a second vector coded with years 
of education, a third with degree status. Suppose we coded degree as 0-no, 
1-high school, 2-associate, 3-bachelors, 4-masters, 5-Ph.D., 6-DDS, 7-MD, 
etc. Now suppose you want the income histogram of people with 16 or 
more years of formal schooling or having a 2 or 4 year degree. Suppose the 
data were organized on an individual-by-individual basis: 


INC 
15843 21842 9823 13586 
ED 
15 18 12 13 
DEG 
2 bs) Q 4 cae 


The first subject has an income of $15,843, went to school for 15 years, and 
has an associate of arts degree. The APL statement that selects our sample 
is 


((ED216) V (DEG=2) v (DEG=1))/INC 


You can select a number of complex combinations of attributes for analysis 
using these techniques. 


6.3. The Normal Distribution* 


Normal Distribution 


The most important distribution you will have to handle in statistics is the 
Normal Distribution. The mathematical expressions for the normal density 
function f(x) and the corresponding cumulative distribution function F(x) 
are: 


_ Exp (=1/2( ~ )?/0%) 


f(x) (Q702)'2 
F(x) = [. EXP Ne dx, 


where Exp (-) denotes the exponential function, i.e., Exp(x) = e?. 


* Remember that, as mentioned in the introduction, starred sections involve statistical material 
beyond the level normally presented in an elementary course. No new APL expressions, functions, or 
procedures which will be used later in the book will be introduced in these sections. 
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Plots of the Normal (0 mean, unit variance) Density and Distribution Functions 


Let us define APL functions to give us the normal density and cumula- 
tive distribution values for any value of x and any value for the mean, p, 
and the variance, oc”. 

In defining our functions, let us agree to let the array P have two ele- 
ments, the first of which is the mean and the second the variance. Let the 
second argument be the x value at which we want to evaluate the func- 
tions. Thus, 


PO 1 
X38 


means that we want to evaluate a normal density function with mean 0, 
variance 1, at the point x = 3. We define: 


VD<P NORMD X 
{1] De(*(-0.5x(X-P[1])*2)+P[21)+(OP[ 2] x2)*0.5 
[2] Vv 


The only operation in this function with which you will not be familiar is 
the monadic function ‘‘pi times’’ (0 called large circle). o is keyed as upper 
case 0, 03 produces 7 x 3 or 9.4244777961, where aw = 3.141592654. 
OP[2]x2 produces the mathematical expression (2770), where go? is the 
variance. 

Experimenting, we obtain 


PO 1 
P NORMD 1 
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0.24197 

P NORMD 2 
0.053991 

P NORMD “1 
0.24197 


Now let us get the corresponding cumulative distribution, or at least a 
reasonable approximation to it. 

The integral of the function P NORMD X can be approximated by adding 
the areas of a series of small rectangles which approximate the area under 
the density function. Let the base of each approximating rectangle have a 
width specified by the user, and let the height be determined by the func- 
tion VORMD at the midpoint of the interval. 

Since the normal integral is theoretically defined from —~, to +=, we will 
have to ‘‘approximate’’ the end points. 

Let the two arguments of the cumulative distribution function be X and 
the array /. The first element of / is the mean, the second the variance, the 
third the chosen interval width in terms of standard deviations, and the 
fourth the number of standard deviations below the mean at which integra- 
tion starts, e.g., 


I<0 116 


indicates that the distribution has a mean of 0, a variance of |, an interval 
width of 1 standard deviation, and the integration begins at (0 — 6) = —6 
on the X axis, or 6 standard deviations to the left of the origin. We define 
(in several lines for clarity): 


Vv A«I NORMC X 
[1] LHS<I(1)-I[4]xS+I[2)*0.5 
[2] X«(X-I[1]) +S 
[3] NINT+|(X-LHS)+IC3) 
C4} A<I03]+.xIf1 2] NORMD (( LHS+I[3]x1WINT)-I[3]+2) 
[Ss] Vv 


The first line determines LHS, the lefthand side from which the integra- 
tion is to start (in our example ~6). Line [2] normalizes the variable of 
integration, i.e., subtracts the mean from the point where we are evaluat- 
ing the function and divides the result by the standard deviation. The third 
line defines VINT as the number of intervals into which the integral is to be 
broken up. The last line approximates the area under the normal density. 
The density function is evaluated at the midpoint of each interval.* 

The procedure is illustrated in Figure 6.2. 


* Note that the righthand argument in this function, X, may only be a scalar as the function is 
currently defined, since if X¥ is an array, attempted execution gives a RANK ERROR. 
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Approximating the Normal Distribution Function by Calculating the Area 
under the Normal Density Function 
(a) 


Area Wanted for Answer 


Interval 
Widths 


Approximating 
Rectangles 


Figure 6.2 


Let us check our function against a set of normal tables. Try 
£204 44.5 
X14 
I NORMC xX 
0.15855 
X+0 
I NORMC xX 


X*#2.5 
I NORMC X 


Comparison of 
APL-STAT 
Routine to 
Tabulated Values 


Summary 
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0.99381 
I NORMC 3.3 
0.99952 


You can see from these results that the routines are quite accurate, since 
for most purposes only three significant digits is enough. Better approxi- 
mation could be obtained by using a smaller value for Z[3] and a larger 
value for 74]. 


Biometrika Tables* 


Ordinate Value Density Cumulative Distribution 
x Computed Table Computed Table 
| 0.24197 0.24197 0.15855 0.15866 
0 0.39894 0.39894 0.5 0.5 
1 0.24197 0.24197 0.84145 0.84134 
2 0.953991 0.95399 0.97729 0.97725 
2.5 0.017528 0.017528 0.99381 0.99379 
3.3 0.001723 0.001723 0.99952 0.99952 


* Values obtained from the Pearson, E. S. & Hartley, H. O., Biometrika Tables for Statisticians, 
Vol 1. (Cambridge University Press, 1962, p. 104). 


Higher Order Sample Moments are defined mathematically by 
D(x; — x)'/(N — 1) forr=2,3,... 


APL routines to calculate them are given on page 72. 
Sample Covariance is defined mathematically by 


Cov(X,¥) = 20x; — *) (vi — 9)/N 
= Lixyi/N — xy 
An APL routine to calculate it is given on page 73. 
Binomial Coefficient: 


(0) = wot 


where n ! is defined byn !=1xX2xX3xX4X%°-:: Xa. 
Factorial, ! (of an integer), (keyed by upper shift X, backspace, 
period): !W produces the product x/1iW 
Combinatorial, !, dyadic use of previous function: &!N produces 
the binomial coefficient (") defined above. 
The binomial probability distribution is defined by 


(") pi — py" 
for r successes in 7 trials, where probability of a success is p. An APL 
function to calculate it is given on page 75. 
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The Poisson probability distribution is defined by 
(e-4M*)/X}, X=0,1,2,..., ande =2.718... 


An APL function that calculates the probabilities is given on page 75. 

Maximum, [, (upper shift 5S): dyadic function which picks the 
larger of its arguments. 

Minimum, |, (upper shift D): dyadic function which picks the 
smaller of its arguments. 

Reshape, p, (upper shift 2): dyadic function which rearranges 
righthand argument into an array with the number of elements determined 
by the lefthand argument. 

Quotes ' ', (upper shift): entries between quote symbols are 
treated as characters, or literals, not as digits and numbers, or variable 
names, or APL functions. 

Logical functions (<, <, >,=, =, #): dyadic functions which com- 
pare left and right arguments. If the stated relation is true for the arguments 
compared, a 1 is produced by the function; otherwise a 0 is produced. 
Examples: 


4<3 


Logical functions (A, *, v,~): dyadic functions which produce 
binary results as indicated in the truth table on page 80. 


Compression, /: dyadic use of reduction. Left argument must be 
either 0, 1, or an array of 0’s, 1’s equal in length to righthand array 
variable. Output is an array of length equal to the number of 1’s and whose 
elements are the elements of the righthand array selected according to the 
position of the 1’s. 

EXAMPLES: 


1/1 2 3 


WeO BLL 2:23 


0/1 2 3 
01 0/1 2 3 


See pages 76 and 77 for an APL function that plots histograms. 
In the starred section, the normal density and cumulative distribution 
functions are calculated with APL routines. 
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Exercises 


APL Practice 


1. 


Let’s explore some of the uses of the logical functions. Let W+?20p5 
and K<-‘AN APPLE A DAY KEEPS THE DOCTOR AWAY’. 


(a) W=5 

(b) W < 14 

(c) W21 

(d) W=7 

(e) 'At=K 

(f) 'P'=K 

(g) 0 > W 

(h) Wso 

(i) 'K'#K 

(j) +/'#'=K How many E’s do we have in K? 

(k) +/W=3 How many 3’s do we have in W? 

(Il) Find the frequency of 2 in W. 

(m) F+(+/W=1), (+/W=2), (+/W=3), (+/W=4), (+/W=5), the row of 
absolute frequencies. 

(n) W='K! 

(0) W=K 

(p) X-.=3; the same result in (K). 


(q) +/W+.<3; the number of elements of W that are smaller than or 
equal to 3. 


(r) FR<«F+pW; a list of relative frequencies. 
(s) +\FR; the cumulative distribution of W where F is defined in (m). 


2. The Maclaurin’s series for the cosine of X in radians is given by 


0 Sama. Gana. Same, Gane. 
cosX = 97 —~artaqr7 @rt gr 


Use the relations 


sinX = V1 — cos? X 


sin X 
tan X = eoaX 
ee ee oe 

ota = tan X 


to find the sine, cosine, tangent and cotangent of 30°. (Hint: use only 
the first twenty terms of the series. 1 degree = 3.14/180 radians) 
Show that 

(a) )(-1)'() = 0 for N = 50, j=0,1,..., 50. 

(b) =3(%) = 2 for N = 50, j=0,1,..., 50. 
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* 5, 


(c) S870) = N- 2873 for N = 50, j= 1,2,..., 50. 

(d) 2-1) 4) =0  forN=50, j=1,..., 50. 

(e) ®) = SR9NC5*) ~~ forj = 0,1,...,20,r = 20,K = 30,N = 60. 
(f) @) = =a)? for N = 50, j=0,1,..., 50. 

(g) DiLe“4/jt +1 forL=5,j=1,..., 10. 


where + means ‘‘approximately equal.’’ 
(h) Ce) = (R*) + ED) 
(i) (&) Gi) / Gd) = GO) 40/4) 

for r = 25, M = 20, N = 15, K = 10. 


An interesting fact is that you can express the relational logical func- 
tions <,<, =, #, >, 2, in terms of ~, v, ¥, A, x, when the arguments are 
binary variables. For example, whenA and B are either 1 or 0, 4>Bcan 
be expressed as 4A~8. In order to see these relationships more clearly, 
write all of the relational logical functions in terms of ~, A, and v. 


Evaluate the integral 
F(x) = i x3dx, using X+(130)#10 


In general, for any two arraysA and B with equal numbers of elements, 
and any two binary functions ‘‘f’’ and ‘‘g’’, the expression Af - gB 
(the inner product of A and B) produces a scalar of the following form: 
(AigB,)f(A2gBo)f(AsgBs)f: + +. ForA —2356andB < 4556, try the 
following: 


(a) A+.xB (e) At+.!B 
(b) Ax.+B (f) A=.=B 
(c) At.=B (g) A-.=B 
(d) A=.+B 


Consider the function f(X) = 10X — X? where X+0 ,110 
(a) First find the integral [ SJ(X)dX algebraically. 


(b) Write a function that will calculate the same integral utilizing Sim- 
son’s Rule which is to be explained below: Simson’s Rule divides 
the interval (0, 10) into an even number of N subintervals, namely, 


5,X, 5,X, oo such that 6,X = X; — Xo = Xo = X15 b5.X = X3 
X_ = X, — X3. . . The area under the curve will be given by the 
sum 


a,x Ada) + 9K) + fC) 5 LU) + Hf) + f0 


+ 8X f(X4) + fie + f(iXe) . 


(c) Write a routine that will calculate the same integral utilizing the 
Trapezoidal Rule. 
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b 
Simpson’s rule for | F(xjdx 
a 


eo" 1 
7 
bd | x2 3 z, 


4 


5X 
—_—)P 


Figure 6.3 
Trapezoidal Rule: Divide (0, 10) into N equal intervals 5,X, 5,X, 
5,X, . . . Use the formula 
5, x Lo) 5 AK) us 5,.xL ew) fa) + 5x £2) wu a 76,91 % 
b 
Trapezoidal rule for i 
Figure 6.4 


(d) And lastly, use the usual rectangular rule, discussed in the text, to 
calculate the integral. The rectangular rule uses the formula 


5X > {(X)) + 5X » f(X2) + 53X - f(Xs) tue 


(e) Which method gives you the most accurate result? which the 
least? 

(f) Which method is more appropriate for 
(a) Convex functions? 
(b) Concave functions? 
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Figure 6.5 Illustration 
of a Concave and a 
Convex Function 


ye Convex Function yer Concave Function 


Statistical Applications 


1. 


2. 


Here is a function that gives you the Rth sample moment about the 
mean. 


V McR MNT X3N3M 
[1] N<p ,X 
[2] Me(+/x)tN 
[3] M<((X-M)*R)#N-1 
[4] Vv 


(a) Which variables are local? 

(b) Which variables are global? 

(c) Which variables are dummy? 

(d) Which variable is the result? 

(e) Define X<— -3 -2 -2 -1-1-1000011 1 223 and find the first 
four moments of X. 

(f) Delete the first —1 from_X to get a list of length 15 and repeat (d). 

(g) Compare (d) and (e). 

(h) Given that the first four moments of N(0, 1), a normal distribution 


with zero mean and unit variance, are 0, 1, 0, 3, compare the 
results in (d) with the population moments of an N(0, 1) variable. 


Use the binomial function (page 75) to solve the following problem. 
Thirty percent of the people in Barrington, Illinois, have blue eyes. 
In a random sample of 10 find the 
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(a) probability that exactly 5 have blue eyes. 

(b) probability that no more than 5S have blue eyes. 
(c) probability that fewer than 5 have blue eyes. 
(d) probability that at least 5 have blue eyes. 

(e) probability that no fewer than 5 have blue eyes. 


Let the variable X take the values +44, +14, +1 with equal probability, 
and let Y = X2. Verify that cov(Y, Y) = 0, i.e., X and Y are not 
correlated, though they are obviously dependent since Y = X?. 


The probability of getting the first success on the kth trial is given by 
f(k) = p( — p)*' (geometric distribution) where p is the probability 
of a success in single trial. If the probabilities of having a male and 
a female offspring are equal, find the probability that a family’s 
fourth child is their first son. 


Let f(X) = e-*, X > 0, be a probability function. Use the terminal to 
search for a value of X, call it X), such that fx*e-*dX = 4, ie., 
the probability that X¥ <X, is equal to 50%. (Hint: use 
X —(v100) + 100, the scan operator, and the rectangular rule de- 
scribed in exercise 7 (d), of this chapter.) 


Use the Poisson and the binomial distributions (page 75) to verify 
the following statement, ‘“‘The one distribution approximates the other 
whenever the sample size is very large and the probability of success 
is small.’’ In light of this statement find the probability that exactly two 
out of a 100 full-time traveling salesmen will be involved in a serious 
automobile accident during a year if the probability of one driver 
being involved in an auto accident during a year is .01. (See exercises 
7 and 8.) 


Write a function, name BI TABLE, that will construct the binomial 
tables for N = 2, 3,...,10,R=0,1,...,MN, and P = 0.05, 
0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 0.95. (Hint: use the N BI P 
function (page 75) and a conditional branching that will repeat the 
same step for different N’s and P’s.) 


Write a function, called POISSON TABLE, that will construct a table 
of the Poisson distribution probabilities for M = 0.01, 0.02,...,1, 
... ,9.9,10;N = 0,... , 10. (Hint: use N POISSON M, page 75, 
and the conditional branching to execute N POISSON M as many times 
as needed.) Note: if the sample size is very large, e.g., 200, and the 
probability of a success very small, e.g., P = 0.03, you cannot use 
the binomial because it involves a lot of calculations. Instead, you 
can use the Poisson with M = NP, e.g., M = (100) (.01) = 1, andN 
depending on the problem. 


Write a function that will solve the following types of problems. 

In a food processing and packaging plant there are on the average 
two packaging machine breakdowns per week. Assume that the 
weekly machine breakdowns follow a Poisson distribution and find 
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10. 


11. 


12. 


13. 
14. 
15. 


(a) The probability that there are no machine breakdowns in a given 
week. 

(b) The probability that there are no more than two machine break- 
downs in a given week. 

(c) The probability that there is at least one breakdown in a given 
week. 

(d) The probability that there is at most one breakdown in a given 
week. 


The sample coefficient of skewness is defined as a, = M,/5S°, where 
M, is the third moment about the mean and S is the standard devia- 
tion. Note: 


if a; > 0 the distribution is skewed to the right. 
if a, < 0 the distribution is skewed to the left. 
if a; = 0 the distribution is symmetric. 


Find the coefficient of skewness of X in exercises 1 (d) and 1 (e). 


The coefficient of kurtosis measures the ‘‘sharpness’’ with which a 
distribution peaks. It is given by a, = M,/S*, where M, is the fourth 
moment about the mean and S is the standard deviation. Find the 
sample coefficient of kurtosis for the list X given in exercises 1 (d) 
and 1 (e). 


Let X+0,110 be binomially distributed, with P=0.5and W=10. Use the 
binomial function (page 75) to find the sample probability distribu- 
tion and compare this distribution to the normal with respect to 
skewness and kurtosis. 


Repeat exercise 12 with X<+0,110, P = 4, and N = 10. 
Repeat exercise 12 with X<+0,110, P = 0.6, and N = 10. 


The joint probability distribution function of two discrete random 
variables X and Y is 


_ foxy foOs=x<4, Ilsys5 
fx, y) = { 0 otherwise 
where x = 0, .1, .2,...,3.0,...,3.9, 4.0 andy = 1, 1.1, 1.2, 
...,3.0,..., 4.9, 5.0. 


Let X<(0,140)+10 and Y«( ¥21)/¥«(0,150)#10 


(a) Find c such that f(x, y) is a probability distribution function, i.e., 
Lt 2fcxy; = 1. 

(b) Find the joint probability PU = x = 3,2 < y = 3). 

(c) Find the joint probability P(x = 3, y = 2). 

(d) Find the f(x; y = 2), i.e., the conditional probabilities of x when 
y=2., 
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(e) Find the conditional expectation E(x; y = 2), i.e., find the condi- 
tional mean value of x given y = 2. 

(f) LetZ<—2+(3x X)+4x X*2,(Z =2+4+ 3X + 4X’). Find 
E(Z; Y = 2), i.e., the mean value of Z given Y = 2. 

(g) Find E[(X — E(X))*, Y = 2], i.e., the variance of X given Y = 2. 

(h) Find the standard deviation of X given Y = 2,i.e., the square root 
of your answer to (f). 


16. Refer to exercise 15 and note that for a given value of Y we geta 
vector of probabilities of X hence the corresponding values for 
E(X) and the variance of X. Find the variance of ¥ for Y = 2 and 
Y = 3. Are the two values the same? 


17. The following table shows the number of days in a 50-day period on 
which X automobile accidents occurred in a city. 


Number 
Number of of Days 
Accidents f 
0 30 
1 14 
2 3 
3 2 
4 1 
5 0 
6 0 
50 


Compute: 

(a) The mean value of the sample (X). 
(b) The variance. 

(c) The standard deviation. 

(d) The coefficient of skewness. 

(e) The coefficient of kurtosis. 


(f) Assume that we want to compare this sample distribution to the 
theoretical Poisson distribution with Y = prob (X accidents) = 
M*e-“/X!, where we replace M, the mean value of the theo- 
retical distribution, with the sample mean X. Find Y forX = 0, 1, 
2, 3, 4, 5, 6. Find the theoretical number of accidents (i.e., 50 
times Y). 

(g) The theoretical variance is equal to M (which in this case is 
estimated by X). Is the sample variance equal to X? 

(h) Why might we wish to compare the sample distribution of acci- 
dents with the theoretical Poisson distribution? 


18. Put the H7ST7 function (page 76) into your workspace. Let F+(135), 
(¥135), and then try 
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(a) 1 HIST P 
2 HIST PF 

3 HIST F 

10 HIST P 


What is the difference between these 4 histograms? 


(b) Draw the histogram of the number of accidents vs. the number of 
days from exercise 17. 
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7.1 Numeric and Character Arrays 


Review Numeric 
Arrays 


We begin this chapter with a brief review. In Chapter 2 you learned how 
to store numeric data in a variable name. You used the left-pointing arrow 
or the specification function, «; for example, typing 4«1,2,3.4 followed 
by EXECUTE or RETURN stores this array of four numbers in locations 
which can be referenced when needed by typing A. In APL you can sepa- 
rate the numbers by spaces (blanks), as we have been doing, or by com- 
mas. Thus you can specify the array by 


Aet, 2,3,4 
A 
1234 


This specification is made possible by using the comma (,) which plays the 
role of the (primitive) catenate function (see pages 41-42). 

Next, suppose you want to add more data to the A array, for example, 
data contained in array B. 


B+ 56789 10 

A-«<A, B 

A 
12345678 9 10 


You have joined two vectors by using the catenate function. Rather than 
joining the whole vector, you might want to catenate some elements—-say 
the first, third, and sixth elements. You could type 


A+A, Bl1i, 3, 6] 
A 
123456789 10 5 7 10 
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Review Character 
Arrays 


DOMAIN ERROR 


LENGTH ERROR 
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A« A,BL1,3,6], would give the same result, and is more usual and per- 
haps quicker. Remember that our /ast definition of A was 4 ~<A, B,so the 
above operation is equivalent to 4«4,28,211,3,61. You can display ele- 
ments of an array in the same way. For example: 


AL1 3 10 4] 
13°20 4 
or 

B[6 6211] 
10 10 6 5 5 


So far we have stored and retrieved numerical data. We can store 
character information with the same commands. To store the alphabetic 
string ECONOMETRICS in vector C, we use the function quote ' ' (upper 
case XK). Type 


C + 'FCONOMETRICS' enclose the string of characters 
; with a single quote at each end 


ECONOMETRICS 

You could also store the characters of a course name and number: 
C « "ECONOMETRICS 801! 
C. 

ECONOMETRICS 801 


Now suppose that you wanted to alter the string, changing just the course 
number. You might try 


CL2). 802 
This would yield 
DOMAIN ERROR 
CL2] = 802 
A 


or, you might try 
CL2] se 1802 

and you would get 

LENGTH ERROR 


CL2] « '802' 
A 


Let’s see what has gone wrong. How many elements are contained in C? 


Display Numeric and 
Character Data on 
One Line with ; 


7.1 Numeric and Character Arrays 99 


You can ask the computer: 
pc 
16 


Each letter, each number, and all blanks (in this case one) are separate 
elements. Remember that the » command tells us the ‘‘shape’’ of an 
array—the number of elements that it contains. Also recall that '801' is not 
a number, but is three characters, viz., 8, 0, 1. We are not able to perform 
arithmetic with characters. 

When you attempted to replace the second element of the character 
vector C with the number 802, you were told that numbers are not in the 
domain of characters. The attempt to enter the three characters '802' into 
the one character element C[2] caused a length error. 

To change the course number, we could change the whole array: 


C + 'HCONOMETRICS 802! 
or just the last three elements 
CL14, 15, 16] « 'go2! 
or just the last element 
C[16] « "2! 
The result in C is the same: 
C 
ECONOMETRICS 802 
Suppose that you had a vector of course titles 
A «+ 'HCON MATH MONEY' 
and a vector of course numbers, say 
B< 802 801 800 


Keep in mind that 4 is a character array and B is a numeric array. Now, 
how would you display 


ECON 802 
You might try 

AC1 23 4), Bf1] 
and it would yield: 
DOMAIN ERROR 

A{1 23 4], Bf1] 


Unfortunately, you can’t mix characters and numeric data. However, 
there are ways around the problem. You can print or display mixed charac- 
ter and numeric data by using the semicolon ; (upper case comma on an 
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APL keyboard): 
A[1 23 44; BL1] 
FCON802 


It might be tempting to try to form an array which could be used as an 
array. Type 


D<+A[1 2 3 4]; B14] 
The computer displays 
HCON802 


and if you did not check to see the result you might believe that the array D 
contained what you want. Try 


D 
The computer responds 
ECON 


The numeric portion is lost. 

You might experiment by reversing the order and see if you obtain the 
same result. Thus, with mixed numeric and character data, we can use the 
semicolon to print the mixed array as output, but we cannot store it that 
way. 

It is possible to convert numerical data to character information with a 
command called Monadic Format. The symbol for the operation is 7; it is 
formed by overstriking tT (upper case J’) and little ©, called jot (upper case 
J). In our example, 


AL1 23 4], ¥ B(1] 
ECON802 


(+ transformed 2[1] into a character array and comma catenates the two 
character arrays.) You can form a new character array this way. 


D<«A{1 23 4), ¥ BL1] 
D 

FCON802 

But be careful. D is a character array, not a numeric array. Try 
D+t 

DOMAIN ERROR 


D+ 1 
A 


7.2 Entering Data Inside a Function 101 


7.2 Entering Data Inside a Function 


Entering Data 
Inside a Function 


Input Continuation 


Using »U 


When you read Chapter 4, you may have wondered how you can enter data 
into a function after the function has been defined. Until now, in executing 
functions, all the data have been stored in the machine prior to execution. If 
you want to enter data ‘‘inside’’ a function, that is, without first specifying 
arguments and then defining global variables to be used in the function, the 
following procedure does the job. 

Before we write a function that uses input in this special way, let’s write 
a function to compute the arithmetic mean. You did this in Chapter 4, but 
let’s do it again. 


V XBAR « MEAN X 
[1] XBAR « (+/X) + p.X V 
To use the function, we store values into the array labelled ZX: 
X «+ 10 30 5 71 15.2 
MEAN X 
26.24 
To do it again, we could enter new data—say, 
B<« 35 4.1 .2031 
X« xX, B 
MEAN X 
15.945 
or 
MEAN B 
3.0758 


Suppose that you had more data than could fit on a line. You could type ,0 
(upper case L, called ‘‘QUAD’’) after one of the numbers. For example, 


LONG+7 11 232 152,0 
Then press RETURN, and the computer responds: 
O: 
Then you could type 
5 21 31 68 
and press RETURN. The array LONG is 
LONG 
7 11 232 152 5 21 31 68 
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Numeric Input 
Quad-Input 
A+{] 
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MEAN LONG 
65.875 


Using ,0 gives you an easy way to input data into an array when the 
amount of data exceeds the line length of the terminal. 

With reference to our mean function, notice that first we changed the 
data and then we executed the function. Why not enter the data inside the 
function? To try this, enter the function 


V XBAR « NUMEAN 
[ty xX <0 
[2] XBAR « (+/X) + 9,X 
[3] v 

To execute NUMEAN: 
NUMEAN 

(you type, followed by RETURN or EXECUTE) 
[Je 


(the computer responds by printing quad, to indicate that it is waiting for 
your data.) You now type: 


13 8 13 14 12.732 
and the computer replies 
8.622 
Now try typing 
Xx 
13 8 13 14 12.732 
We now see that<«L enables us to define a global variable; X is global in the 


above function. NUMEAN can also be used with the previously defined 
global variables. Try 


PT PSS: OP B39 
NUMEAN 


which is the mean of Y. 

You might be thinking, ‘‘Nice, but so what? I have to type in as much 
information.’’ True, but what if we add the unconditional transfer? The 
program could be 


V XBAR<BETRYET 


Making a Program 
Interactive Using U 
and Branching 


Terminating Quad 
Input with > 


Character Input 
Quote-Quad Input 
A+ 
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(1) Xx] 
[2] XBAR«(+/X)*p ,X 
[3] XBAR 
C4] +1 
[5] Vv 
Now try it: 
BETRYET 
O: 
1245 12 
4.8 
O: 
2 .12 .034 34.1 71 
21.4508 
O: 


Wait! How do we stop it? The machine will keep asking you for input. You 
might try to type in such words as STOP, END, etc., but you will have no 
luck. Try using ‘‘branch,’”’ which is the right-pointing arrow > (keyed by 
upper case +). Now you are out of the function. 

You can see how we have developed an interactive program, i.e., a 
program that interacts with you by prompting you. It might be easier to 
calculate means in this new way, especially if you had a large number of 
separate data sets and you were unsure about exactly how many sets there 
were. 

What about character information? We can handle it in almost the same 
way, but we must use ‘‘quote quad,’’ [|] [type quad (upper case ZL), 
backspace, and then quote (upper case X)]. The new program might be 


VY BESTYET 
[1] I+ 0 
[2] AGAIN: T«I+1 

[3] DISP «+ J 

Cu] x <Q 

[5] XBAR « (+/X) + 9,X 

C6] 'THE NAME OF THE DATA SET IS ' 3; DISP 


[7] 'NUMBER OF DATA SETS READ THUS FAR ='3 I 


[8] 'THE MEAN = '; XBAR 
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QUAD and Quote- 
Quad Input 
Request: 
> for « 

and 
O backspace U 
backspace 7 

for 


Data and Information—-How to Get It In and Out 


[9] + AGAIN 
[10] Vv 

Try it: 
BESTYET 
re 


Notice that the response for [] is not the same as for]; here we get a 
‘‘blank’’ response! (Some systems respond with a blinking cursor, under- 
score, or color.) 

You name the data set by typing 


CHICAGO SMSA 


Then enter the data: 
33.7 27.3 31.4 84.2 33.9 
THE NAME OF THE DATA SET IS CHICAGO SMSA 
NUMBER OF DATA SETS READ THUS FAR = 1 
THE MEAN = 42.08 

And again: 

SAN FRANCISCO SMSA 


34.2 414.2 48.7 84.3 44.2 


THE WAME OF THE DATA SET IS SAN FRANCISCO SMSA 
NUMBER OF DATA SETS READ THUS FAR = 2 
THE MEAN = 50.52 

AAA 


Terminating Input 


How does one get out of the above program? You could type any charac- 
ter for the name. Then type > for the numeric data. To terminate the 
request for quote-quad input you enter 0, then backspace, U, then back- 
space, 7, and then press return. This will interrupt the execution of your 
function. To double-check, type 


)SL 


(SI = state indicator: indicates which routines are ‘‘suspended,”’ i.e., still 
trying to finish execution, and where the suspension occurs. A blank re- 
sponse by the computer means nothing is suspended.) 
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Clearly this is not the only, or even the best, way to handle this prob- 
lem. One advantage of introducing the problem in this way was that it gave 
you another useful ‘‘emergency’”’ tool in your APL tool kit. But how else 
might we handle this problem? Another method, which we have used 
before, is the conditional branch. We can instruct the user that when he is 
finished entering data to calculate means, he is to instruct the computer to 
finish the operation by typing in 


FINISHED 


Now all that we have to do is to insert after statement [3] a conditional 
branch that instructs the computer to leave this routine when it encounters 
the characters FINISHED. Consider the following: 


[3.5 ]58XITx18=+/'FINISHED' =8pDISP 
(10] BXIT:70 


When the character string FINISHED is read and stored in DISP, state- 
ment [3.5] tells the computer to go to the statement named EXIT. The 
latter statement simply terminates the function’s execution. If DISP does 
not contain FINISHED, then 'FINISHED'=8pDISP_ is an array of one to 
eight zeros. Applying +/ to this array (character by character) yields the 
number less than 8: the logical comparison 8 = some number less than 8 
also yields 0, and 10 gives an empty array. APL interprets ‘‘go to an 


empty array’’ as a statement to be ignored. 
Let’s proceed to alter BESTYET. 


V BESTYET! 30] 
C3J[3.5+RXITx18=+/'FINISHED' =8pDISP 
[3.6](10] EXIT:>0V 


Don’t dismay. We know that the editing of the function is not clear to you 
yet. You will learn exactly how to do it in the next chapter. Jt is presented 
here so that you can edit the function and understand the main points 
which concern entering data and branching. 

Now let’s retry the function. 


BESTYET 
NEW YORK SMSA 
OD: 

123 4 
THE NAME OF THESE DATA IS NEW YORK SMSA 
NUMBER OF DATA SETS READ THUS FAR = 1 
THE MEAN = 2.5 


DETROIT SMSA 
Lis 
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123456 
THE NAME OF THE DATA IS DETROIT SMSA 
NUMBER OF DATA SETS READ THUS FAR = 2 
THE MEAN = 3.5 
FINISHED 


& & & & 1. & 
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Saving Workspace 


) CONTINUE 


)CONTINUE : 
PASSWORD 


If you are working at a terminal, you can preserve your workspace so that 
your functions and variables will be available to you at your next session. 
Just type 


)CONTINUE 
and the computer will respond 
08:21:31 04/15/80 


indicating that your workspace was stored in your personal APL library at 
8:21:31 on 15 January, 1980. You now have stored one workspace in your 
private library, and the )CONTINUE systems command also disconnected 
you from the system. It is just as if you issued the )OF¥ command that you 
used in Chapter 1, with the exception that you have preserved your work- 
space in a private library. When we first introduced workspaces, we said 
that they could be thought of as a part of the computer that was allocated 
to the individual APL user. Now we can expand on that and think of a 
workspace as a file. A computer file is an electronic analogue of a manila 
file folder. So an APL workspace can be retrieved from the “file cabinet,”’ 
loaded, updated, saved, and erased. The name of the file is the name of the 
workspace. 

You can establish a password in the same way in which the )OFF com- 
mand was used. For example, 


)CONTINUE : FARM 


However, when you log-on, you must use the new password: 
)1984: FARM 
But now the system responds 
O62 08:16:01 01/16/80 
OPR: SYSTEM AVAILABLE TO 22:30 
SAVED 08:21:34 01/15/80 


)CONTINUE HOLD 


)WSID 
JWSID ID 


)SAVE ID 


)LOAD ID 
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The last line would have been CLEAR WS if your previous signoff was )OFF . 
But withthe ) CONTINUE command, your workspace is exactly as it was when 
you issued the command. You might like to know that the computer auto- 
matically uses this routine if your terminal is inadvertently disconnected 
from the system. When you subsequently reestablish communication, your 
workspace is also reestablished, just as if you had issued the )CONTINUE 
command. 
AS a matter of convenience, you may use 


JCONTINUE HOLD 


In this case the APL system recognizes the word HOLD, and the com- 
munications line to the computer is held open for approximately 60 sec- 
onds. This saves the next user the bother of reestablishing the communica- 
tion link with the computer. If you are going to use this command, it is 
recommended that the new user sit at the terminal and log-off the old user. 
This avoids a computerized version of musical chairs. However, don’t just 
leave your terminal on in the active mode, waiting for someone to log you 
off. You are probably being charged for this connect time. Also, someone 
could log you off with a new password known to him and unknown to you. 

It is possible to save a workspace without logging-off. The first step is to 
identify the workspace with a name. This is done with the )WSTD (Work- 
Space IDentification) command. 


)WSID MONEY 
WAS CLEAR WS 


Your workspace is renamed MONEY. The computer responded with 
WAS CLEAR WS, indicating the former name of the workspace. The next 
step is to actually save your active workspace. The command is 


)SAVE MONEY 


Your active workspace is saved in your private library under the name 
MONEY. After you issue the systems command )SAVF MONEY, the com- 
puter responds 


9:04:21 01/16/80 


indicating that the workspace was saved at 9:01:21 on 16 January, 1980. 
You can proceed with more computing after issuing the )SAVE command. 

To load the saved workspace from your private library into your active 
workspace, you issue the command 


)LOAD MONEY 


The active workspace (it could be a clear ws) that existed before the 
command was issued is replaced by the workspace named MONEY. The 
computer responds with 


SAVED 9:01:21 01/16/80 


telling you when the workspace MONEY was saved. Since it is possible to 
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)LEIB 


)SAVE CONTINUE 


COPY 


)PCOPY 
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save a number of workspaces, you may want to know the names of the 
workspaces stored in your private library. The command is 


)LIB 


and the computer responds 
CONTINUE 


MONEY 


This indicates that you have two stored workspaces in your library—one 
named CONTINUE, the other named MONEY. 

Whenever you issue the )CONTTNUE command, the workspace is stored 
under the name CONTINUE, This workspace is automatically loaded 
when you sign on. If you want to save your workspace and have it auto- 
matically loaded, you can execute the following command: 


)SAVE CONTINUE 


This command saves the active workspace under the special name CON- 
TINUE. It is as if you typed )CONTINUE, but you have not logged-off the 
computer. 

So far we have replaced the old workspace with the new one by the use 
of the }LO4PD command. Another way to update a workspace is to add one 
workspace to another or to add functions or variables to an existing work- 
space. This is accomplished with the )COPY command. For example, you 
may have a number of functions stored in one workspace and a number of 
data sets stored in another workspace. The data (variables) and the pro- 
gram (functions) must be in the same active workspace to perform compu- 
tations. Suppose that you have a function named REGRE, which was 
stored in workspace STAT. You could load it by 


)ZOAD STAT 


Suppose that the data are stored in the workspace named DAT under the 
variables names Y and X. You can add the data from DAT by 


)COPY DAT Y x 
The general form of the copy command is 
)COPY NAME ENTITY1 ... ENTITYN 


NAME is the name of the workspace, and ENTITY represents either a 
function or a variable. If the entity is omitted, all the variables and func- 
tions are added to the existing active workspace. When a function or 
variable conflict exists between the existing active workspace and any of 
those in the copy command—that is, the same variable (or function) name 
appears in both workspaces—the copy command takes precedence. The 
existing values are replaced by the ones in the copy command. If you want 
to be protected against possible unintentional conflicts or inadvertent re- 
placements, you can protect your existing workspace by using the 


)PCOPY 


)CLEAR 


)DROP ID 


Private Library 
) ERASE 
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command, which will not resolve any conflicts between the existing and 
copied workspaces, but will notify you of such conflicts. )PCOPY will only 
copy those entities for which no conflict exists. 

After you have continued, saved, loaded, and copied a number of work- 
spaces, it is a good idea to check what functions are in the workspace with 
the )F¥NS command, what variables are stored in the workspace with 
)VARS, and the name or identification of the active workspace with )WSID. 

You can clear the workspace with the 


)CLEAR 


instruction. You can drop a workspace from your private library with the 
command 


)DROP MONEY 
and the computer responds with 
9:12:07 01/16/80 


telling you when the workspace named MONEY was dropped from your 
private library. It is now impossible to retrieve that dropped workspace. 
Finally, to erase a function or variable you enter )#RASZ and the name of 
the item(s). You can see the result of this command by entering ) VARS or 
)FNS . 

You now know a few more ways to get data into and out of the com- 
puter. Many other ways are possible, but these basic methods will get you 
through many situations that you are likely to encounter in your work. 

Here is an example of a terminal session that might help you to review 
the concepts presented in this section. 


CLEAR WS 

VMEAN 
[1]  AVE<(+/X)z0X { function to compute mean 
[2] V 

VDATAGEN function generates 25 random 
[1] X*+2571000 numbers from 1 to 1000 without 
[2] Vv replacement. 

FNS We have two functions (DATAGEN 
DATAGEN MEAN and MEAN) and no variables in this 

) VARS active workspace. 

)WSID The name of this active workspace 
IS CLEAR WS is CLEAR WS. 

)SAVE CASH | We saved the workspace under the 
08:42:41 01/18/80 name CASH. 
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)CLEAR We clear the workspace. Computer 
CLEAR WS responds that we have a cleared 
)FNS workspace, and we have no variables 
)VARS or functions in the workspace. We 
)LIB have one stored workspace named 
CASH CASH. 
We copy the function Datagen into 
)COPY CASH DATAGEN the active workspace. Remember, 
SAVED 08:42:41 01/18/80 we have another function called 
MEAN in the stored workspace CASH. 
)FNS We check this and note that only 
DATAGEN DATAGEN was indeed transferred to 
)VARS the workspace. 
The name of the active workspace 
)WSID is not altered by th 
IS CLEAR WS is not altered by the copy com- 
mand. 
DATAGEN | Execute the function and list the 
result. 
Xx 


132 756 459 533 219 48 679 680 935 
384 520 831 35 54 530 672 8 67 418 
687 589 931 847 527 92 


JENS 
DATAGEN The effect of this has been to 

)VARS create a variable X in the workspace. 
Xx 

)SAVE COINS We save this workspace under the 
08:45:13 01/18/80 name COINS. 

)LIB 
CASH Note that the workspaces COINS 
COINS and CASH are not the same. 


We have executed the DATAGEN 
program again. The storing of a 
workspace does not change the 
contents of the active workspace. 


DATAGEN 


Xx 
654 416 702 911 763 263 48 737 329 
633 575 992 366 248 983 723 754 652 
73 632 885 273 437 767 478 


Saving Workspaces 
on Microcomputer 


Summary 
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)ZOAD COINS We load the workspace COINS into 
SAVED 08:45:13 01/18/80 the active workspace 


Xx { and list the values of X. 


132 756 459 533 219 48 679 


680 935 384 520 831 35 54u om that the loading of X values 


from COINS replaced the existing 


530 672 8 67 418 687 589 
values in the workspace. 


931 847 527 92 


The computer will not allow us to 
)SAVE CASH replace the values in a stored 
NOT SAVED, THIS WS IS COINS | workspace with those in an active 
workspace if the WSJDs are different. 
)WSID Our workspace name is COINS from 
IS COINS our last load command. 
\WSID CASH { We change the name of the active 
WAS COINS workspace 
)SAVE CASH and save it (thus replacing the 
08:49:00 01/18/80 previous version stored in CASH). 
)DROP COINS This removes COINS from our 
08:52:14 01/18/80 private library. 


)OFF 
LOG OFF 08:52:32 01/18/80 


END OF SESSION 


You can log-on and see that CASA is still stored on your private library. If 
you had logged-off with the )CONTINUF systems command, you would 
have a second copy of CASH in your private library under the name 
CONTINUE. It would be automatically loaded when you logged-on again. 

You are now at the stage where you can experiment with passwords and 
the )PCOPY command in order to see how they work in practice. Appendix 
D contains an explanation of one way to save your workspace on a mi- 
crocomputer. 


We reviewed the assignment of numeric and character data. DOMAIN ERROR 
was generated when an attempt was made to catenate the two types of 
data. LENGTH ERROR was generated when an attempt was made to assign 
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more than one character to an element in a character array. Data can be 
entered into the computer from an executing function. Numeric data are 
entered via |] and character data are entered vial. When you want to enter 
more data than can be held on a line of your terminal, enter , U and press 
return. 

You can save your active workspace by entering the systems command 
)CONTINUE. When you log-on the next time, this workspace, rather than 
CLEAR WS, comes up. By typing )COVTINUE:PASS you will have to use the 
password PASS when you log-on. Other uses of CONTINUE were also dis- 
cussed. 

Another systems command is )//S/D; it allows you to display and change 
the name of your workspace. )SAVZ ID allows you to save the current 
active workspace under the name /D. To use this workspace you enter 
)EO4D ID. You could bring part of this workspace to your CLEAR WS or to 
your active workspace by entering )COPY ID Y xX. This would bring the Y 
and X (either functions or variables) from workspace 7D to your current 
workspace. It is as if you loaded only x and Y. If you already had a function 
or variable named xX or Y, the )COPY command would cause the existing 
item to be replaced. The )PCOPY instruction copies those items for which 
no conflicts exist and informs you of conflicts. 

To clear your whole workspace enter )CLEHAR, to erase functions or 
variables from a workspace enter )ZRASE X Y, and to drop a workspace 
from your library enter )DROP ID. The remaining workspaces can be dis- 
played by )Z78. 


APL Practice 
1. Let the variables A, B, C, D be 
A«1 234 
B<+5 6 7 8 9 10 
CH'A BC Dt 
De'h PF 


(A) Try to predict the result before entering the following operations 
on the computer: 


a) A,B d) C.D 
b) A.B,B e) C3D 
c) A,BL3),404] f) C,C3Bl4] 


(B 


— 


Enter the following and predict the result. If you get an error 
explain why. 

a) A+4,B c) A«4,B[3],A[ 24] 

b) A<4,B,B d) C<C,D 
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e) C+C3D h) C+C&B 
f) C+C,C3BL4, 1) pB+B,B 
g) C<C,F BIS] 

Enter the following function: 

V MEAN<MEAN 

[1] HEADER 

[2] DATA] 

[3] MEAN++ /DATA+ 9 DATA V 


(a) Execute the function. 

(b) Use any name you choose for a header. 

(c) Compute the mean of:1 2 10 4 6. 

(d) Compute the mean of (7@8.3)+ 1 100. 

Display the names of all the functions in your workspace. 

Display the names of all the variables in your workspace. 

Display the name of your workspace. 

(a) Why is the function MEAN computationally inefficient? 

(b) How would you write it to make it more efficient? 

(c) What would be the effect of putting a comma between p and 
DATA? 

In line 10 of BESTYET what would the following produce and why? 

(a) > (c) +10 

(b) +0 (d) +20 

(a) Save the current workspace under the name STAT. 

(b) Erase all functions, variables and obtain a clear workspace. 


(c) Enter a function that computes the standard deviation using the 
result from MEAN. 


(d) Compute the mean and standard deviation of the 200 element 
array (01100), (23x100?100). 

How would you ask the computer to: 

(a) Copy the function CORR from the file whose name is STAT. 

(b) Copy the variable Y from the file whose name is DATA. 

(c) Copy the variable X from the file whose name is DA, given you 
already have another variable with the name X on your work- 
space. 

(d) Find the correlation of X and Y, using function CORR. 

(e) Store the answer in the file whose name is JE. 


Suppose that in a file you have saved the workspace STAT which 
contains some functions and some variables. Write down the neces- 
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11. 


sary commands that will allow you to add variable X into the file 
STAT. 


In the function BESTYET we added the line 
[3.5 ]+#XTTx18=+/'FINISHED'=8 pDISP 


to allow for a more orderly termination of the routine. Rewrite that 
line using a logical function rather than +/. 


Statistical Applications 


1. 


3. 


Write a routine that will do the following: 

(a) Asks you for a number from 10 to 100. Call it N. 

(b) Take a random sample of size N from the integers 1 to 5. 

(c) Calculate the sample mean; call it M1. 

(d) Repeat the sampling process from (a) to find a second mean called 
M2. 

(e) Find the mean and the variance of M1 and M2 [in this case 
(M1 + M2)/2], call it MM2. Find the sample variance of M1 and 
M2 [in this case ((M1 — MM2)? + (M2 — MM?2)*)/(2 — 1)], and 
call this $§2. 

(f) Repeat the sampling process from (a) to find anew mean called M3, 
and recompute the ‘‘average’’ mean and variance, say MM3 and 
SS3. 

(g) Repeat the sampling process up to the point where the difference 
of the sample variances of the MM,—M™M,_, is less than .001. How 
many samples does it take? 


Let the variables X¥ and Y be defined as X+Y¥xY<: ("4 ~3 “2 2 3 4) 

and the variables W<Y-X. Write a function to calculate the correlation 

between any two variables and use this function to verify that: 

(a) X and Y are not correlated (1.e., their sample correlation coeffi- 
cient is close to zero). 

(b) W and X are perfectly negatively correlated (i.e., their sample 
correlation coefficient is very close to ~1). 

(c) In general, how would you generate two discrete random variables 
that are perfectly negatively correlated? 


The joint discrete distribution of two arrays A and B is: 
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(a) Compute the marginal distributions of A and B. 

(b) Compute the conditional distribution of A given B = 10. 
(c) Are A and B independent? 

(d) Are A and B uncorrelated? 


One form of the central limit theorem states that if X,,i=1;...,N 
are independent random variables with mean yw and variance o?, and 
Z, = (X - uyVn/o is the standardized X;, then Z; is distributed as 
standard normal (i.e., Z; has mean value 0 and variance 1). Write a 
function that will give you a random sample X,, X2, °: : X, of size 30 
of a random variable which takes on values 1, 2, -:- 10, each with 
probability .1, and calculate z, = ((¥ — 5.5)/V8.25)) V30. Repeat the 
process nine times and calculate the sample mean and variance of 
Z;, 2, °°°* Luo: 

Write a function that will utilize the functions I NORMC X (page 85) 
and the function NORM D (page 84) to calculate the tables of the 
standard normal distribution for the values of —4 to 4 in intervals of .1. 
Compare your table to the standard normal tables. 


An economist who believes in cardinal utility had to decide on one of 
two dates. Date A would yield 40,000 utils* if successful and 4,000 utils 
if unsuccessful. Date B would yield 70,000 utils if successful and 10 
utils if unsuccessful. Chances of a successful date were subjectively 
estimated to be equal for A and B. The economist was in a quandary 
about what to do. (She was indifferent between the two choices.) So 
she decided to compute the probabilities of success that would make A 
the preferred choice, B the preferred choice, and A and B equally 
attractive on the basis of expected utility. Next, suppose that she 
could be assured of receiving 8,000 utils by remaining home and learn- 
ing more about APL. Which of the three alternatives would she now 
choose? 

Finally, assume that the probabilities were such that the expected 
utilities were in fact equal. However, the economist could spend time, 
and thus utils, in improving the chances of success in case B. In this 
case the probability of success in A might remain the same as before, 
and the chance of success in B would be larger than that in A. State a 
decision rule for spending utils to improve the probability of success. 
If she could buy an increase in the probability of success to 20% and 
reduce the probability of failure to 80% for date B at the cost of 6,000 
utils, should she do it? 


* A util is an arbitrary measure of satisfaction. 


More on Functions 


Thus far we have been very careful with the definitions of our functions 
and have cautioned you to be very careful in typing in the suggested APL 
expressions. Now is the time to become more adventurous in our writing 
of functions and to no longer worry unduly about making mistakes. One of 
the delights of APL for those of us who are used to programming in 
FORTRAN and similar languages is that correcting mistakes and errors is 
so much easier in APL than it is in the other computer languages. We all 
make mistakes, so it is reassuring to know that correcting our errors will 
not be difficult. 


8.1 Function Display, Correction, and Editing 
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You will find that you will want to be able to examine your function from 
time to time and, if it is a long one with many statements in it, you will 
often want to look at only a part of the function, not at all of it. Next, you 
will discover that you will need to alter, if not to correct, one or more lines 
in your function. So we need easy ways to display a function and to alter, 
add, or delete lines in it. 

The next most important task is to be able to figure out how and why 
your function does not work or does not produce the results it should. This 
brings us to the intriguing world of diagnostics. Let’s begin with the sim- 
pler task of correcting errors we know about, before we learn how to 
discover the ones we don’t know about. 

If your typing is like ours, you made a few typing errors when entering 
the function statements. As long as you caught them on the line on which 
they were made, prior to pressing the RETURN or EXECUTE key, they 
were easy to correct. We discussed a method of correcting such errors on 
page 25. However, if you discovered the typo after the function definition 
was closed off, that is, after the final ¥. you had troubles. On page 61 we 


Function Display 


Display One or 
Several Lines of a 
Function [#0] or 


[ 


#] 
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showed one way to correct the errors. Here are some more ways to modify 
your function. Let’s start with your old friend from the previous chapter, 
BESTYET. 

To display your function, type 


VBESTYETLOI V < you type 


C1] Io 

[2] AGAIN: I+I+1 

[3] DISP 

[4] +EXITx18=+/'FINISHED'=8 p DISP 
[5] xX] 
6] XBAR+(+/X)#pX computer responds 


[7] 'THE NAME OF THESE DATA IS ',DISP 
[8] "NUMBER OF DATA SETS READ THUS FAR = '31I 


9] 'THE MEAN = ';XBAR 


[10] +AGAIN 


[11] FXIT:>0 


V 


As you may remember, this is how to obtain a display or listing of the 
whole function. Should you want to display only one statement in the 
function, for example, number three, then you can type 


VBESTYETL 30] V 


and the computer prints 
[3] DIsP+) 


What you have done is to open the function by typing VBESTYET. State- 
ment [3] is displayed by using the J (quad) symbol, and the function is then 
closed with the final V (del). If you had typed 


V BESTYET(IO3I V 


you would have received all the function statements from [3] to the end of 
BESTYET. 
Suppose you want to modify a single statement, say [7]. You can type 


VY BESTYETL7] ‘THE NAME OF DATA SET IS ';DISP V 


This will replace the former statement number 7 with this new one. An- 
other way to modify a statement is 


V BESTYETL8U] < you type 
[8] ‘NUMBER OF DATA SETS READ THUSFAR = ';T «— computer responds 
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More on Functions 


This is the first time that we have not closed the function. You have 
opened the function, the computer has displayed statement [8], and you 
are now able to modify it by typing 


[8] 7;'DATA SETS HAVE BEEN READ' <- you type 
{9] <— computer responds 


Now you must close the function by typing on line [9] the symbol V. 
Yet another way to modify your function is to display the function and 
omit the closing del. 


VY BESTYET(LOI 


This will list your function and allow you to add a statement at the end of 
the function, or to modify other function statements or to insert new 
statements in your program. Let’s start by adding a statement at the end. 
After listing the entire function, including the final V, the computer will 


type 
[12] 
and you can add: 


[12] 'THIS STATEMENT CANNOT BE REACHED! 


Next we could modify statement [9] by 


C13] [90] <— you type to display line 9 
[9] 'THE MEAN =';XBAR <— the computer responds 
[9] 'THE ARITHMETIC MEAN =';XBAR < you type a correction 


[10] <— the computer responds with 
the next line number 


To insert a statement between statement [2] and statement [3] you would 
type 

[10] (2.2]'ENTER THE NAME OF THE DATA SET‘ 

and you could add 

[2.3] 'TATS STATEMENT HAS NO PURPOSE! 

[2.4] V 


Before we delete the last statement we just inserted, we should list the 
function: 


| sae | 


VBESTYETL 


VBESTYET 
[1] <0 
[2] AGAIN:I<I+1 


Statements 
Automatically 
Renumbered 


Deleting a Line in a 
Function 
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[3] 'ENTER THE NAME OF THE DATA SET! 
C4] ‘THIS STATEMENT HAS NO PURPOSE' 


ES) DSP 
[6] +EXITx18=+/'FINISHED'=8 po DISP 
C7d) xe 


[8] XBAR+ (+/X)+oX 


'THE NAME OF THE DATA SET IS';DISP 
[10] I;'DATA SETS HAVE BEEN READ! 

[11] 'THE ARITHMETIC MEAN = ';XBAR 

[12] >AGAIN 

[13] #XIT:>+0 

[14] 'THIS STATEMENT CAN NOT BE REACHED! 
Vv 


[15] 


(Notice that the statements we entered as [2.2], [2.3] are renumbered as 
[3], [4], and all subsequent line numbers are increased by two: another 
reason for using line /abels in branch statements!) 

The last (useless) statement we added is now Number 4; we added it in 
order to show you how to delete a line. On some systems we can type 


[15] [Ay] 


On other systems this may not work. Instead of [A4], type [4] and hit 
LINEFEED followed by EXECUTE or, on some terminals, ATTN fol- 
lowed by EXECUTE. 

If we decided that we want to prompt the user of this function when data 
are entered, we might insert 


V BESTYET [140] 


[144] (€4.1] 'ENTER DATA, SEPARATE OBSERVATIONS BY BLANKS AND 
PRESS EXECUTE AFTER LAST SAMPLE! 


[4.2] V 


Display the function as rewritten once again, and note the renumbering of 
the statements. Renumbering would cause problems if your branch state- 
ments referred to a specific line number, so that is why it is best to label 
statements that receive branches. 

For example, consider this very simple function and its change. Enter 


V TRIAL 
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f1] +2 <— first statement instructs computer 


‘ + ‘ 2 
[2] tEND' ¥ to execute ‘END’ by branching to [2] 


Now alter the function as follows: 


Vv TRIALL 30) 
3] (1.1J4+1 


[1.2] V 
V TRIALLD) V 

V TRIAL 

f1] +2 
After change, lines are renumbered 

[2] Ae4 but the branch statement is not, so 
[3] tpt the instruction in [1] is no longer 

9 correct. 


Now try using the function with the data as before. You can treat the 
function header as if it were another statement with line count 0 (re- 
member: zero—not alphabetic ‘‘oh’’). 


VBESTYETLOO] < you type 

< computer responds—note the next 
number is not put in until you 
hit RETURN 


[0] BSTYET < you type 


[0] BESTYET 


[1] Vv 


We have renamed the function. We could also change the ‘“‘type’’ of the 
function by altering the header in various ways. For example, 


VBSTYET(OO) 
[0] BSTYET 
[1]£0] XBAR<BSTYET 
[1] Vv 


To erase the whole function, type )ERASE BSTYET. To reassure yourself 
that it is gone, type 


BSTYET 
VALUE ERROR 
BSTYET 

A 


or type )FNS and check to see that BSTYET is no longer listed. 
After you have gained a little practice you will find that your ability to 
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edit your functions will become second nature. Of course, it is all very well 
to be expert at editing your functions and correcting errors if you know 
what has to be done. The real challenge is finding out what went wrong 
with your function; even more challenging are those cases wherein you are 
not sure anything is wrong, but it might be! We now come to computer 
detective work: how to find errors and mistakes. As in all detective work, 
there are some rules to know and some recommended procedures to fol- 
low, but after that you are on your own in learning the arcane art of 
computer error detection. Let’s begin with some useful procedures. 


8.2 Diagnostic Procedures 


Diagnostics Define the function 
V NEW<ECON xX 
[1] WEWepxXz(+/(1+X)) 
[2] GEO<(x/X)*(1+pX) 
[3] Vv 


What are the expected results? When the vector X containing 2 4 8 is 
executed, the harmonic mean should be 3.4286, and the geometric mean 
should be 4. 

What does the computer produce? 


X*2 4 8 
ECON X 
3 
Is this satisfactory? Of course not, but what about the variable GEO? 
GEO 
m 


This is OK, but is this ‘‘test’’ adequate? The latter question is not easily 
answered in general. First, let’s alter ECON to print GEO every time it is 
executed. 


Vv ECONO) 


VY NEW+ECON -X 
[1] NEW+o9 X3(+/(1:X)) 
[2] GEO<(x/X)*(1%0 X) 
V 
[3] Ukcko Vv 
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ECON X 


Is this the order you expected? If not, remember that when the function is 
executed, GEO will be printed at line [3] and then NEW will be printed 
upon completion of execution. 

Let us now work through our routine one step at a time in order to figure 
out what we have done wrong. 


1. Let’s write the mathematical expression we want. 


NEW = j —* 
—+-— 


1 
. + —— 
X, X2 Xn 


for N observations on vector X. Compute the answer by hand: 


NEW = D1 1 = 3.4286 
tats 


2. Are the data correct? 


Yes 


3. Now that we are reasonably sure that the error is in line [1], let’s find 
out exactly where it is. Try doing each part of the line separately: 


a. -(1:X) 
0.5 0.25 0.125 
b. (+/(1:X)) 
0.875 
c. oX 
3 
d. oX#(+/(1#X)) 
3 


Now we have found a problem. Two components—each apparently 
correct—fail to produce the correct result when combined. Remember that 
the computer operates from right to left. So let’s reexamine the entire line. 
If you do, you will recognize that you need parentheses around pX. 
Consider 


Xt(+/(1#X)) 
2.2857 4.5714 9.1429 


Trace; TA 
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oX+(+/(14X)) 
3 

The function is producing a result which we would obtain if we had written 
po (X#(+/(14X))) 


So all that we did in line [1] was to find the number of elements in the string 
(X + (+/(1 + X))). What we really want is 


(oX)+(4+/(14X)) 


3.4286 


The Trace Function 


Now let us suppose that our function is a little more complicated and 
that we do not know where our problem initially occurs. In order to locate 
the source of the error, we can use the ‘‘trace’’ function. For example, 
suppose we want to print out the intermediate results in a routine called 
DSTAT. In particular let us suppose that we want to examine the results of 
the computations performed at lines [1], [2], and [4]. We can do this by 
typing 

TADSTAT+1 2 4 


where A is upper case H. This allows us to see the results of each statement 
listed. The numbers on the right of the « are the statements we want to 
trace. In order to implement the trace operation, having first specified what 
is to be traced, we merely execute the function. Before doing that, let us 
display the function DSTAT, which we defined in Chapter 5 and which you 
might have stored in a file after reading Chapter 7. 


VY DSTATLOI V 


VY DSTAT X 
[1] R<(MAX+«X[p XJ) -MIN<(X+XT4XJ)01) 

[2] SD<(VAR<(+/ (X-MEAN+<(+/X)+N)*2)+ (N* p X)-1)*0.5 
[3] MD<(+/|X-MEAN)=N 

[uJ MED<0.5x+/XU0((N52),1+L Ne2] 

[5] "SAMPLE SIZE" 

[6] wW 

[7] ‘MAXIMUM! 

[8] MAX 

[9] ‘MINIMUM! 
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10] MIN 

11] 'RANGE' 

12] Ff 

[13] "MEAN! 
[14] MEAN 

15] "VARIANCE! 


[16] VAR 
{17] "STANDARD DEVIATION 


[18] SD 


(19] 'MBAN DEVIATION! 
[20] MD 
[21] "MEDIAN! 


[22] MED 
V 
We need an array of numbers on which to operate, so let us define 
W<9 10 2812015 
DSTAT W 
DSTATL1] 12 


DSTATL 2] 42517821852 tae by the 


Trace function 
DSTATL4] 6.5 


SAMPLE STZE <—begins the programmed output 
8 
MAXIMUM 


12 


MINIMUM 


12 
MEAN 
5.875 


VARIANCE 


Stop; SA 
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20.41071429 
STANDARD DEVIATION 
4.517821852 

MEAN DEVIATION 
3.875 

MEDIAN 

6.5 


What the trace function does is to print out certain intermediate values 
in a routine, as specified by the array in the statement 
TAFUNCTION NAME*ARRAY. What is printed on each line is the final (left- 
most) variable defined by the operator +. That is why, in tracing line [1], 
we get R (the range), but neither MAX nor MIN. 

The trace function will remain in force until you remove it. This is done 
by typing in 

TADSTAT<10 


That is, we instruct the computer to trace the DSTAT line numbers con- 
tained in an empty array. 
Before leaving trace, try: 


TADSTAT<0 


The Stop Operator 
Another very useful diagnostic tool is the stop operator, which is used in 
a manner similar to the trace operator. The stop operator is activated by 
typing 
SADSTAT+1 4 
and used by calling 
DSTAT W 
DSTATL13 


What happens is that when DSTAT is executed, the routine stops automat- 
ically at the line before the one indicated. In our example, the function 
DSTAT is ready to execute the first line, but has not yet done so. When a 
routine is suspended in this way, you can do a host of other calculations, 
print results, etc. The function remains suspended while these other opera- 
tions are being performed. 

Try typing in 


R 
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VALUE ERROR 


MAX 


MIN 
VALUE ERROR 

MIN 

A 

x 


9102812015 


These results show that because the routine has been stopped before R, 
MAX, and MIN have been defined, we get value errors if we try to display 
them. The output also shows we are inside the routine, since the dummy 
variable X contains the array W. 

In order to execute line [1], we type 


>1 
DSTATL4] 


and not only line [1], but also lines 2 and 3 will be executed. However, line 
[4] will not be executed, since we have a stop at [4]. If we had typed +2 
instead of +1 above, execution of line [1] would not have been carried out, 
and so the variables R, MAX, and MIN would still not be defined. The 
instruction +2 instructs the computer to go to line [2], execute it, and then 
continue, executing line by line, to the next stopped line. 

We happen to know that at the moment our routine DSTAT is suspended 
just before line [4], but after a confusing terminal session, we may not be so 
sure of ourselves. Fortunately, in APL we have an easy way to find out. 
Type in the system command 


)ST 
DSTATL 4 1* 


SZ stands for state indicator; it tells us which routines are currently sus- 
pended, as well as where the suspensions have occurred. As you see from 
our example, the position is given by the next line to be executed. That is, 
line [4] of DSTAT is the next statement to be executed. 

You may wonder: Why the asterisk, *? The reason is that it distinguishes 
between suspended functions and pendent functions. Any APL expression 


SIV 
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can include another program function. When an APL statement cannot be 
completed because a program function it called is suspended, the former 
function is called pendent and the latter is called suspended. If statement 
[2] in function A used function B, and statement [3] in function B used 
function DSTAT, which contained an undefined value at statement number 
[1], then the state indicator would be 


)ST t 
DSTAT[11* 

BC3] 

AL2] 


Suspended functions have a * and pendent ones don’t. 

Both suspended and pendent functions can cause you a lot of mysterious 
errors if you are not careful, so the best advice is to get rid of them as soon 
as possible. Before we show you how to do that, we will point out that 
functions usually get suspended not by the use of stop commands, but 
because an error is encountered which halts execution—for example, a 
length, or value, or domain error. 

Consequently, if your routine is stopped in its execution by an error 
and you decide to reexecute after putting in a stop or a trace command, 
remember first to get rid of the suspended and pendent functions. 

This is done very simply. Type in 


> 


and repeat it as many times as is necessary to get a blank response to your 
query 


)ST 


System Command )SIV 


Another useful system command is )SIV. This command provides the 
same information as )S7, but in addition it gives you a list of the dummy 
variables that appear in the header of the suspended function. 


A Case Study in Program Development and the 
8.3 Location and Correction of Program Errors 


Debugging 


In order to give you some idea of the problems involved in finding and 
removing errors from a program, a process which is known as ‘‘debug- 
ging,’’ we are going to work through a case study of writing an APL 
routine for a ‘‘live’’ practical problem. 

As you will soon see, the way in which this program was written is a 
classic example in many respects of how not to write a routine; however, 
it is a simple example, and at the moment that’s what counts most. 
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This example concerns the effectiveness of a new drug for the treatment 
of duodenal ulcers. As part of the study the question was asked, ‘‘Does 
this drug have an effect on the work hours lost due to the disease?”’ 
Medical researchers collected data and decided to test the hypothesis 
using the Student ¢ distribution. Since the researchers had access to APL, 
they decided to write an APL program to compute the ¢ statistic. Here is 
how they did it. 

First, they went to a textbook which contained the formula for the 
Statistical test they wanted, as well as a test problem with a computed 
result.* The equation for the test statistic they used was: 


x -— X, 
[(Ni = Dsi + (Ne = Dst [1 1 
N, + N, aa 2. N, Nz 


where X,, X>, 53, 53, N,, No, are the respective sample means, variances, 
and sample sizes. 

In the textbook problem, two small random samples were drawn—one 
from freshmen, the other from seniors—of the amounts of money that 
these students had on their persons. The null hypothesis was that the 
average amount of money held by freshmen was equal to that held by 
seniors. The alternative hypothesis was that the two means were unequal. 
The following statistics were cited: 


t= 


Variable Symbol Freshmen Symbol Seniors 
Average Amount of Money Xe $1.28 X $2.02 
Standard Deviation S, 0.51 Se 0.43 
Number of Observations N, 10 Nz 12 


The critical value of the ¢ statistic at the 5% significance level for (N, + 
N, ~ 2) = 20 degrees of freedom is +2.086 for a two-tailed test. If the 
computed ¢ value is larger than the positive or smaller than the negative 
value, we would reject the null hypothesis at this level. 

Here is a listing of the program written by the medical researchers: 


V PTTESTLO) V 


Vo TTTEST 
[1} SS<S1*2 
[2] SSS<S2*2 


[3] NoW1i-1 
C4] NTN2-1 
[5] A<((NOxSS)+(NTXSSS) )+N1+N2-2 


[6] A<A*0.5 


C7] B<€(1#N1)+(14N2))*0.5 


* The text was Hamburg, Morris, Statistical Analysis for Decision Making (New York: Harcourt, 
Brace and World, Inc., 1970), pp. 347-348. 


How Mistakes 
Begin 
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[8] C+AxB 
C9] P<(X1-X2)+C 
V 


The first two statements convert the standard deviations provided in the 
examples to the variances. The next two statements each subtract one 
from the number of observations. Statement [5] computes the ratio in the 
first radical; number [6] takes its square root. Statement [7] computes 
the value of the second radical and number [8] computes the denominator 
of the ratio. The last statement computes the tf statistic itself. 

Notice that they took each step and broke it down into small parts, each 
part getting one line. This procedure creates more lines, but it makes each 
line very easy to understand. They could have put many of these steps on 
one line. However, if one does that, then locating possible errors can 
become more difficult. 

The next step was to store the test data in their active workspace: 


X1+1.28 
X2<2.02 
S1<0.51 
N1<10 
S2+0.43 
N2<12 

and execute the function: 
TTEST 

“3.6953 


Since the result was the same as that reported in the text, the medical 
researchers concluded that the program was functioning correctly. The 
next step should have been to compute other test values and see if they 
worked, too, but the researchers didn’t do this. 

Next the researchers decided that they would have to compute the stan- 
dard deviation themselves from their raw data. To accomplish this, they 
constructed a second function, although the routine could have been in- 
corporated in the 7TEST program. The textbook formula for the standard 
deviation was 


_ | 21% - XP 
STDEV = Wo 


and their APL program to do this is listed below. 
V STDEVLO] V 
V S1«STDEV X 
C1] XBAR+(+/X)+oX 
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[2] SA« +/(XBAR®.-X)*2 
[3] S1«(SA*0.5)#(pX)-1 
Vv 


The next step was to revise the TTEST program in order to incorporate 
this calculation. Here is that revised program: 


VY TTEST(OI V 


V T<TTEST 
[1] STDEV X 
[2] No«(p X)-1 


13 X1<XBAR 
C4 J SSS1*2 
ies) STDEV Y 
NT<(p Y)-1 
X2«XBAR 


8] SSS*«S81*2 
[9] A+«((NOxSS)+(NTXSSS))+N1+N2-2 


[10] A<Ax0.5 
[11] B<((12N1)4+(14N2))*0.5 
[12] C<AxB 
[asd Peer =C 
V 


Let’s go through this. Line [1] executes the standard deviation func- 
tion. Line [2] subtracts 1 from the first sample size, line [3] renames the 
first group mean (so that its value will not be lost in line [5]), and line [4] 
computes the variance from the standard deviation computed in STDEV. 
Lines [5] through [8] repeat the process for the second group of data. 
The remaining part of this function is the same as before. They entered a 
test set of data and executed the program: 


¥+10 20 30 40 50 
X*1 23456789 10 
TTEST 

1.0092 

7.9057 


“12.981 


OUTER PRODUCT 
(°o.g) 
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The standard deviation and the t value are printed. The values obtained 
seemed to be in the ball park, so they proceeded to the real data. 

Even at this point in our discussion you can see that the medical re- 
searchers have gotten themselves into some peculiar programming, be- 
cause it would appear that they did not think ahead and plan out what they 
wanted to do. For example, they wrote a separate routine to calculate 
standard deviations, which they promptly resquared in TTEST. Also, if 
they had considered more carefully which variables should be local and 
which global, they would have been able to avoid having to store XBAR 
into X1 and X2 and then taking the difference. 

There is one operation in this routine with which you are not yet famil- 
iar; that is the outer product operation (° . g} which appears in line [2] of 
STDEV. The symbol g represents any dyadic operator, and outer product is 
always used in the dyadic mode. The operation (XBARo.-X)*2 produces 
an array, each element of which is (X; — XBAR)?. The same result could be 
obtained by +/(XBAR-X)*2. The latter approach to programming might be 
preferable to the former in that it is both easier to understand and on some 
systems is quicker to execute. The outer product operation (°.—) is a 
most useful tool, but is probably ‘‘overkill’? for our simple needs in this 
routine. 

The data that the researchers used are: 


TREATMENT 
413455552341506235555255354 

55113545155550 

0 TREATMENT 
4.0 

CONTROL 
2.55022.515535553355404030513 

13525012 

0 CONTROL 


33 


indicating the number of work days lost for each patient in the two groups 
before the medication was administered. The researchers had to store 
these data under the names X and Y, since the programs as written can use 
only arrays X and Y as input (another reason for thinking ahead and con- 
sidering carefully the use of dummy variables in the header). 


X*TREATMENT 
Y<CONTROL 
Upon executing the program, they obtained 


TTEST 
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0.27425 
0.32809 
9.1397 
and printed the means: 


X1 


X2 
2.9094 


The means were correct (this was known from previous work), but the 
value for t of 9.1397 seemed to be inordinately high. Now they became 
suspicious. Were the data wrong? Was this acorrect, even though surprising, 
result? Or was there something wrong with this patchwork programming? 

The ¢ statistic function looks good in the sense that it produced results 
that were corroborated from an outside source, but they did alter it after 
the test. However, the standard deviation program was not checked in the 
same way. The program was so simple that they did not run any test data. 
The mistake they made is an example of a very common type of error. Let 
us check the routine out with a simple data set that we can easily compute 
by using the calculator or immediate execution mode of APL. 


X*+110 
x 
42345678 9 10 
STDEV X 
1.0092 
XBAR 
5.5 


The mean is correct, but the standard deviation is wrong! The hand calcu- 
lation is Standard Deviation = 


+/(XBAR-X)*2 
82.5 

(82.579.0)*.5 
3.0277 


We now know that there is a problem, but not where that problem occurs. 
So let’s use our trace function. 


TASTDEV+1 2 3 
STDEV X 
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STDEV[1] 5.5 
STDEVL2] 82.5 
STDEV(3] 1.0092 
1.0092 
Line [1] we have already checked by hand, and we also find that line [2] is 
correct. So the error must occur in line [3]. Line [3] is 
[3] S1«€ SA*0.5)#(p X)-1 
Rethinking through this line from right to left, we discover that they com- 


puted (V=(¥; — X)2)/(N — 1) rather than (V=(X; — X)?/(N — 1)). We can 
correct this error by editing STDEV: 


V STDEVE 30] 


[3] Si+« ( SAx0.5)#(p ¥)-1 

[3] Si ( SAt((p X)-1))*0.5 

c4] Vv 

Upon executing the revised function, we obtain 
STDEV X 

STDEVL1] 5.5 

STDEVL2] 82.5 

STDEV{.3} 3.0277 


3.0277 


which yields the correct answer. 
Remember that the trace function will stay in effect until we retract it 
with the following command: 


TASTDEV+10 


Our next step is to retry the 7TEST program with the new STDEV 
program. We generate a second data set Y and run the program. 


x 

12345678 9 10 
¥«10 20 30 40 50 
TTEST 

3.0277 

15.8114 


~380.03 


Our hand calculation for the standard deviation of Y is correct, but the 
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value for ¢ of — 139.16 is wrong. Our hand calculation is 


_ 5.5- 30 _ 
V83.269 V0.3 


Consequently, TTEST, which was thought to be correct, is in fact in er- 
ror. Let us compare a trace on J7TEST run before correcting STDEV with 
one run afterwards, and see if we get some clues as to where the error 
occurs. 


— 4.9019 


TTEST 
STDEVL1] 5.5 
STDEVL2] 82.5 
STDEVL 3] 1.0092 
TTEST([1] 1.0092 
TTESTL2] 9 


TTEST(3] 5.5 


TTEST[4] 250 

STDEVE1] 30 

STDEVL2] 1000 

STDEVL3] 7.9057 

TTEST(.5] 7.9057 

TTEST(6] 4 

TTESTL7] 30 

TTEST[8] 250 

TTESTL9] 9.5588 

TTEST(10J] 3.0917 

TTEST[11] 0.180889 

TTEST(12] 0.33667 

TTEST[13] ~72.771 

~72.774 

Next, we run the trace function on our program with the corrections: 
TTEST 

STDEVL1] 5.5 

STDEVE2] 82.5 

STDEV[3] 3.0277 


Problems from 
Careless Use of 
Global Variable 


A Student's t Test 
Routine 
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TTESTL1] 3.0277 
TTESTL2). 9 
PPEGT USI S45 


TTEST([ 4] 
STDEVL1] 
STDEV[ 2] 
STDEVI 3) 
TTEST[ 5] 
TTESTL6 1] 


TTESTL7] 


TTESTL 8 


TTESTT 9] 


TTEST(.10] 
TTEST( 11] 


250 

30 

1000 

15.811 

15.811 

4 

30 

250 

9.5588 
3.0917 
0.10889 


TTESTL12] 0.33667 


TTEST[13] 72.771 


IO TTS: 


Compare line [3] in STDEV with line [13] in TTEST. We have different 
inputs (from STDEV) but the same output from TTEST. If we look at 
statement [9] in the display of the function, we see that the results of lines 
(10], [11], [12], and [13] are the same, even though we have different 
values for the standard deviation. First we notice that N1 and N2 are 
not defined in the program itself. Whatever values variables N1 and N2 
happen to have will be used in the program. We need statements like N1 <— 
p X and N2 <— p Y. But is this all? No. We also notice that the dummy 
variable in the header of the function STDEV is used as if it were a global 
variable! We should have received an error message when we tried to 
execute TTEST. The reason we did not is that a global variable S1 was 
defined earlier and is still floating around waiting to trap the unwary. 

Here we have two more reasons why great care is needed in deciding 
which variables should be dummies in the header, which local, and which 
global. It is also a warning to keep track of the globally defined variables 
and to erase those no longer needed. 

The rewritten program is listed below. Note in particular lines [1], [3], 
[5S], and [7]. 


VY TTESTLOI] V 
VIT+TTEST 
[1] WN1« px 
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NO+ (9 X)-1 
[3] SS+ €( STDEV X)x*2 


[4] X1+XBAR 
[5] Ne#o Y 
[6] Wr+ (p Y)-1 


7] SSS+ (STDEV Y)x2 
[8] X2<XBAR 


[9] A+ (€ NOxSS)+ (© WPxSSS))+N1+N2-2 
[10] A+Ax0.5 

[11] B<« ((44N1) +€ 14N2))+0.5 

[12] C«AxB 


[13] 77+ ( X1-X2)sC 
Vv 
Trace Routine Running the routine with the trace operator and the test data yields 
TTEST 

TTESTL1] 10 

TTESTL2] 9 

STDEVL11] 5.5 

STDEVL.2] 82.5 

STDEVL 3} 3.0277 
TTEST(3] 9.1667 
PTESTLY] 5.5 
TTESTLS 5 
TTESTT6] 4 
STDEVL1} 30 
STDEVEL2] 1000 


STDEVL3] 15.811 
TTESTL7] 250 
TTESTL8] 30 


TTEST(9] 83.269 


TTESTL10] 9.1252 
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TTEST[11] 0.54772 
TTESTL12] 4.9981 
TTEST[13] ~4.9019 


“4.9019 
These results check with hand computation at every stage, so we can now 
have more confidence in the correctness of this routine. 


The new TTEST routine gives an answer of t = 1.8908 rather than? = 
9.1397 in the comparison of the treatment and control data arrays that 


were defined on page 131. 


Summary 
Display of functions and use of quad, 0, (uppershift 7): 
VY function name [0]V: displays the entire function 
V function name [3])]: displays line [3] 
Vv function name [o[]]v: displays header of function 
VY function name [03]: displays function from line [3] to end 
Vv function name [4] (new APL expression): changes line [4] from 
what it was to that specified 
v function name [0]: displays function and then enters next 
line number ready for an additional 
statement (previous last line plus one) 
A line may be inserted between two existing lines, say 2 and 3, ina 
function by using: 
[13][2.2] (APL expression to be inserted) 
Deleting a line, say number 4: 
£13] (44 <— you enter [4] at line [13] 
Hit Line Feed or ATTN key 
Hit Execute key 
If you use branches in your APL statements, either conditional or 
unconditional, use line labels, not numbers. 
Examples: 
Unconditional Branch Conditional Branch 
Use: >EXIT Use: >EXITxi1A > B 
Not: +10 Not: +10x14 > B 
Computer Computer Diagnostic Tools: 


Diagnostic Tools 


Trace: TA function name <— 1, l,..., 1, sets up a ‘‘trace opera- 
tion’ that will be activated every time the function is used. 1, <1, <1,.. 
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< 1, are line numbers. Upon function execution. trace will print each line 
number, 1, /,.. . , 1, being traced, followed by the final (i.e., left-most) 
computed result of the line, which is usually the last variable defined 
(assigned a value) in each line. 

Trace is removed by: 


TA function name <10 


Stop: S A function name <j, h,..., ], sets up a ‘‘stop opera- 
tion’’ that will be activated every time the function is used. |,;< <<... 
< 1, are line numbers. Upon function execution, stop will print the first line 
number listed, /,, before executing it; all prior lines will have been exe- 
cuted. The computer will not execute line /,. Line/, or any subsequent line 
can be executed by typing right arrow and line number /. For example: 


35 


While the function is stopped by the STOP command, arithmetic opera- 
tions can be used and functions called. 
Stop is removed by: 


SA function name <10 


Any function that cannot continue executing its statements, either because 
of a programming error or because of a STOP command, is said to be 


suspended, 
Any function that cannot be executed because it calls a suspended func- 


tion is said to be pendent. 

Example of the above and the use of )SZ (state indicator system 
command): Suppose that at line [3] function A uses function B, which is 
suspended at line [5]. 


\SL <-you type 
BIS ]x <function B is suspended at 5 
AL3] <function A is pendent at 3 


To clear both suspended and pendent functions, type 
> 


as many times as necessary to get a blank response from the command 
)SI. 


APL Practice 


1. Let’s investigate some of the uses of the functions you learned in this 
chapter. 


(a) 54+{k44+3 
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(b) Type 3x1] and execute. The computer responds U:. Type 6. What 
do you get? 


(c) Type X+{} and execute. Now type 1. What is in X? Type X = 1 
and X = ‘1. 


(d) X,Mand execute. 

(e) Examine the difference between 10 p ‘W’, ‘TAC’ and 
10p'W's'TAC!. 

(f) Let 7«'20' and type 7=20. 

(g) Compare p'' and ¢'A!. 


2. The following two functions carry out exactly the same operations. 


V A COM B V A COMP B 
[1] >Bxi1A > B [1] +4+2xxA-B 
[2] +SxiA < B [2] 'LESS* 
[3] "EQUAL! [3] +0 
C4] >0 [4] "EQUAL! 
[5] B: 'GREATER' [5] >+0 
[6] +0 [6] 'GREATER' 
[7] S: "SMALLER! [7] Vv 
[8] Vv 


(a) What must the arguments be—scalars, arrays, or characters? 
(b) What is the result of the functions? 


3. The following function is meant to calculate the simple correlation 
coefficient between two variables X and Y. 


V R«X SC Y;MX;MY;NUM;DENOM 
[1] MxX<( +/x )+ p X 
[2] MY<( +/Y )+ p Y 
C3} NUM<+/( X-XM)+( Y-MY) 
[4] DENOM<(( +/( X-XM)*2)xx/( Y-MY)*2)*.5 
[5] R«NUM+DENOM 
Vv 
The mathematical formula for the correlation coefficient is 
pe hi = =) 
Find the two mistakes in the APL function. 


4. The function BICO was constructed to calculate the coefficients of the 
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10. 


11. 


expansion (a + b)" for given n, i.e., the binomial coefficients. See if 
you are getting the desired result. If not, can you detect the error? 


V R«BICO N 
[i] Ret 
[2] R+( 0,R)+R,0 
[3] -W2 p R/2 V 


Write a function that will give you the first N elements of the 
Fibonacci series 1, 1, 2, 3, 5, 8, 13,... . (Each number in the series is 
equal to the sum of the two previous numbers.) 


My objective was to draw a pyramid. I tried: ' At(1+@20)°.<120] 
and I got a tilted pyramid. Can you help me? 


Write an APL expression that will replace with K all of the 
elements less than or equal to K in a given array W. 


The powerful jot dot operator and its uses are summarized in the 
following exercises. 


(a) Xo.xX«120 (g) (X+1)°.@X (m) Xo. LL/X 
(b) Xo.=X (h) Xo.*X (n) Xo.#X 
(c) Xo. > X (i) Xo. *0 (0) Xe. <10 
(d) Xo.2>X (j) 0°. *X (p) Xe.245 
(e) Xo. < X (k) 1°.*X (q) Xe.130 
(f) Xo.<X (I) Xo. P ) /X (r) Xo. «2 


Show that the expression +/(14)°.*2 is equivalent to (1 + N) x (1 + 
2 xX N) X N + 6 for any integer N. 


The generalized inner product has more uses than we showed in the 
text. Consider 4+3 4 5  Be15 


(a) Ax.-B (g) A!.L /A 
(b) Ax.-2 (h) A+. x2 
(c) 2x.-A (i) Ax.+2 
(d) B.-A (j) At+.@A 
(e) Bf. LB (k) 4@.%*A 
(f) At. L /A 


Since there are twenty-four primitive (scalar) dyadic functions, there 
are 406(2! 24) possible combinations. Good luck. 


Enter the function TTEST (pages 135-136) on your workspace and 
carry out the following tasks: 

(a) Display the function but don’t close it. 

(b) Between lines [12] and [13] put the line 


'THE T-STATISTIC IS! 
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(c) Rename the function as 
X TTEST Y 


(d) Put the appropriate local variables in the function header. 


Statistical Applications 


1. The nicotine content in milligrams of a random sample of two kinds of 
cigarettes is given below. 
Brand A: 16.2 17.7 16.7 15.9 15.1 
Brand B: 14.8 17.5 16.1 13.3 15.6 
Use your TTEST function to find out if the two brands have on the 
average the same nicotine content. (These are not paired observations.) 


2. Ten persons engaged in a prescribed program of physical exercise for 
weight reduction for a period of one month. The results are given in the 
following table. 


Weight (Ib) 
Before the program: 208 215 196 185 232 156 188 195 232 167 
After the program: 205-219) 185) 175) 207) «132s 195): 158 )=— 198 ~~ s:167 


Was the program effective? The program is effective if the mean 
weight after the program is statistically smaller than the mean weight 
before the program. (Hint: Let Y = after — before.) 


3. In 1960, a sample of 300 ten-year-old boys had mean height 52.8 
inches, with standard deviation 2.3 inches. In 1975, a sample of 600 
ten-year old boys had mean height of 53.9 inches, with standard devia- 
tion 2.5 inches. Since the samples were very large, you may take 
ai and a3 as being known: a? = (2.3)? = 5.29; of = (2.5)? = 6.25 so 
that X¥, — X, has a variance of (5.29/300) + (6.25/600) = 0.28. Can 
we conclude that on the average the ten-year-old boys of the seventies 
are taller than the ten-year-old boys of the sixties? 


4. Here is a group of problems that ask you to check if the Poisson 
distribution can be used to fit some data. We remind you that the 
Poisson distribution is given by P(X) = M*e“/X!, X = 0, 1,2,... 
and that 

(i) EX) = M 
(ii) VAR (X) = M 
(iii) Sy = VM 
(iv) as = 1/\/M, coefficient of skewness 
(v) a, = 3 + 1/M, coefficient of kurtosis. 
Notice that since a; > 0, the Poisson distribution is skewed to the 
right. 
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Combine the POISSON function (page 75), the MNTS function 
(page 76), the HIST function (page 77), the Poisson tables of exercise 
9, Chapter 5, and the coefficients of skewness and kurtosis into a 
POISSON FIT function whose output will consist of 


(i) The first four sample moments. 
(ii) The sample histogram. 
(iii) The theoretical values of X. 


(iv) The probabilities of X being greater or less than a specified value 
given in the function header. 


Use this function to solve the following problems: 

(a) A study was made to determine whether the deaths of centenarians 
were distribuied randomly over time. In the data below, X repre- 
sents the number of such deaths that occurred in any one day, andf 
represents the number of such days in a set of one thousand days. 
Xx 0 1 2 3 4 5 6 7 8 9 
f 229 325 257 119 50 17 2 1 0 0 
Fit the Poisson distribution to these data. Find the probability that 
at most one death occurs on a given day. 

(b) In the following table, X represents the number of shirts bought by 

Ff men that walked into a clothing store in one day. 

Xx 0 1 2 3 4 

f 35 40 16 8 1 

Use the POISSON FIT function to check if the Poisson distribution 


fits the data satisfactorily. Find the probability that a randomly 
selected customer will buy at least one shirt. 


(c) The following table gives the number of times (X) that your APL 
terminal malfunctioned, and the respective number of days (f) 
that a malfunction occurred during a one year period. 

XxX 0 I 2 3 4 5 


f 220 80 50 8 2 0 
Fit the Poisson distribution to these data and find the probability 


that during a specific day you will have no problem with your 
terminal. 


5. The owner of a bakery knows that the daily demand for a highly- 
perishable cheesecake is as shown in the following table. 


Daily Demand Probability 
0 0.05 
10 0.15 
20 0.25 
30 0.30 
40 0.15 
50 0.10 


1.00 
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Since the most likely daily demand is 30 cakes, he decides to cook 30 
cakes per day. Assume that for each cheesecake he cooks and sells he 
makes a profit of $3, while for each one he cooks but does not sell he 
loses $2 (assuming that he must throw it away). (Hint: For every 
number he could bake between 1 and 50, use the computer to compute 
the expected profit.) 

(a) Find his expected profit if he decides to cook 30 cakes. (Hint: If he 
cooks 30 and sells 0 he loses $60, and this happens with probability 
0.05, and so on.) 

(b) Find the optimum number of cakes that he can bake (i.e., the 
number that will maximize his expected profit). Notice that the 
profit-maximizing daily demand does not equal the maximum 
daily demand. 


6. A process for making steel pipe is under control if the diameter of the 
pipe is 5 inches. The known value of the standard deviation is 0.015 
inches. In order to test whether the process ts under control, a random 
sample of size 30 is taken, and the mean value of the sample is found to 
be 3.0078. If a level of significance of a = 0.01 is used, should the 
process be adjusted or not? 
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Elementary Linear Regression, 
Goodness of Fit, and 


Analysis of Variance 
(ANOVA) Problems 


9.1 Introduction to Linear Regression 


Simple Least 
Squares Regression 


Estimators of 
Regression 
Coefficient 


Estimators of the 
Variance of 
Regression 
Estimators 
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Let us begin with the simplest form of a linear regression model—one 
found and discussed in every introductory textbook on statistics or 
econometrics; some useful textbooks are listed in the bibliography at the 
end of the book. The mathematical model is 


Y=at+bx+U 


where Y is the regressand, X is the regressor, and U is called a disturbance 
term. Only Y and X are observed; U is unobserved. All textbooks assume 
that you have n observations, or, in our new APZ language, we would say 
we have two arrays of length n called Y and X. 

The idea of regression analysis is to find values fora and b, say 4, 6, such 
that we minimize the sum of squared deviations (differences) between the 
elements of the array Y and those of the array Y defined by Y= a + bX. 
Mathematically, this means that 6, @ are given by 


b = (xX; — X) (¥, — ¥)/S(X, -— X)? 
= (= X,Y; — nXY)/(= X? — nX?) 


X = 2 X,/n, Y= LY,/n, nXY = DY,X 
G=Y~bxX 
Correspondingly, the estimated variance of U and the variances of the 
coefficient estimators a, b are defined mathematically by 
s? = VAR (U) = S(¥; — ¥)*(n — 2) 
Pi a - 
VAR (0) = s(x — X)?)™ 
VAR (4) = s*(n7! + ¥?/3(X, — X)?) 
a a - = 
VAR (4,6) = — Xs?/3(X, — X)? 


Breaking up the 
Sum of Squares 
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An important identity in regression analysis is that the total sum of 
squares is equal to the sum of the regression and error sums of squares. 
Algebraically, this is 

SY, = Y= 27, = FP + =(Y, = YF 
SST SSR SSE 
Here SST is the total sum of squares, SSR is the regression sum of squares, 
and SSE is the error sum of squares. 
The coefficient of determination is given by the formula. 


R? = SSR/SST = 1.0 — SSE/SST 


9.2 An APL Program for Linear Regression Analysis 


Once you have a clear and accurate mathematical statement of what you 
want, you can write down the appropriate APL expression. Let us define a 
little APL regression function to calculate the above statistics for any pair 
of arrays Y and X. 

We will define a regression routine which does not return an explicit 
result, which has two arguments, and we will make sure that all other 
variables used in the routine are local. Now that we know what to do, we 
can easily write down (on paper) each line in sequence. As we add vari- 
ables, which we want to be local, in the body of the routine, we can add the 
variable names to the header. When the routine is written out on a piece of 
paper, we can check it over for errors and then enter it on the terminal. 

The header will start as 


V Y REGRESS X 


An examination of the series of mathematical formulas listed above 
shows that the calculation of 6 is central to the whole analysis, so let’s 
begin by defining B, the APL symbol representing the estimator 6. The 
following series of lines will give you one idea of how to go about writing a 
‘‘one-liner’’. Start off with the basic mathematical notion and work out- 
wards from there, keeping in mind that the computer reads from right to 
left. But do not expect to get everything not only right, but elegant or even 
computationally efficient, the first time. Let’s define 8, the APL variable 
for b. Each line of the text below represents a subsequent stage in fitting 
together the whole line. 


XM is going to be the 
mean of X and ts useful 
to define equations 
needed later 

N is going to be the 
length of X and will be 
used a lot below 


[1] B«<« (Y+ .x X) -+/¥x XM 


[1] B« ( Yt .xX)-( +/Y)xxMt + / 
( X-XM< ( + /X)#N)*2 
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XSQis the sum of 
[1] BCC ¥Y + .xX)-(C +/¥)xXM)+ squared deviations of 
(xXSQ « + / ( X-XMe ( +/X)+Nep X)*2) | X from XM, which will 
also be used repeatedly 


Reread line [1] from right to left, ensuring that it does what is intended and 
that the parentheses match. 


[2] A+ + /Y:N-B xXM 


+/Y+N is inefficient; Y:W-Bx XM 
does not work as intended 


[2] Ae (C 4/Y) EN) Beem { aris is nearly right, but we will 


need the mean of Y later 
[2] A« ( YM<« ( +/Y)#N)-BxXM 


Check this line, which now seems reasonable, and let’s proceed to the 
next. The remaining lines will give you no problems. In the preface we 
computed these coefficients using YX and we could have used that method 
here too. X would be an N-by-2 array where the first column is composed 
of 1s. This method is presented in Chapter 13. 

The whole program might look like this: 


)CLEAR 
Regression Routine CLEAR WS 

VY Y REGRESS X 
C1] BeCC Y + .xX)-€C +/Y)xXM)= (XSQ* +/ (X-XM<( +/X)tNep X)*2) 
[2] A« ( ¥YMe ( +/Y)#N)- BxXM 
[3] SST<« ( +/ CY-YM)*2) 
[4] SSE<+ / ( Y- ( At+BxX))*2 
[S] V«SSE+ ( N-2) 
[6] VB«VixXSQ 
[7] VA*« (CC XM*2)#XSQ)+3N)xV 
[8] COV+-XMxV=XSQ 
[9] ASQ+1-SSEtSST 


[10] 'SAMPLE SIZE IS SN 
[11] 'MEAN OF X IS "3 XM 
[12] 'MEAN OF Y IS '; YM 
[13] "VARIANCE OF X IS "3 XSQtN 
[14] 'VARTANCE OF Y IS 'SSSTtN 


[15] "VARIANCE OF ERROR IS SV 


Sample Results of 
Regression Routine 
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[16] 'COEFFICIENT OF A = '3A 
[17] ‘VARIANCE OF A = 'SVA 
[18] '7-RATIO = 'SA=VA*0.5 
[19] 'CORFFICIENT OF B = 'sB 
[20] 'VARIANCE OF B = ';VB 
[21] 'T-RATIO = 'sBtVBx0.5 


[22] ‘COEFFICIENT OF DETERMINATION IS 'SRSQ 
[23] "COVARIANCE ( A, B ) TS ';COV 
V 
¥<«55 70 90 100 90 105 80 110 125 115 130 130 


X*«100 90 80 70 70 70 70 65 60 60 55 50 


Y REGRESS X 
SAMPLE SIZE IS 12 
MEAN OF X IS 70 
MEAN OF Y IS 100 
VARIANCE OF X IS 187.5 
VARIANCE OF Y IS 525 
VARIANCE OF ERROR IS 69.889 
COEFFICIENT OF A = 210.44 
VARIANCE OF A = 158.03 
T-RATIO = 16.74 
COEFFICIENT OF B = “1.5778 
VARIANCE OF B = 0.031062 
T-RATIO = “8.95 


COEFFICIENT OF DETERMINATION IS 0.88907 
COVARIANCE ( A, B ) IS “2.1743 


Now you know how to calculate‘ various statistics used in simple linear 


regression. . . . 
We can define Y = @ + 6X, the residuals V = Y — Y, plot relationships 


between them, calculate the variance of the estimator Y, and so on. Just for 
practice, try the routine RESID, which will calculate the residuals and Y 
ready for plotting or otherwise analyzing the results. Consider 


VRESIDLUIV 
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Residual Routine V RESID 
[1] YH<A+Bxx 
[2] RES<Y-YH 
[3] 'RESIDUALS'* 
C4] RES 
(5] "COMPUTED VALUES OF Y>2>>>+Y-HAT! 
[6] YA 
V 
RESID 
RESIDUALS 
2.333333333 1.555555556 5.777777778 0 10 5 20 2.114111111 
9.222222222 ~0.7777777778 6. 333333333 ~1.555555556 
COMPUTED VALUES OF Y>*>>>>Y-HAT 
52.66666667 68.4444444un 84,22222222 100 100 100 100 107.8888889 
115.7777778 115.7777778 123.6666667 131.5555556 


9.3. Goodness of Fit, Contingency Tables, and ANOVA Problems 


Chi-Square Distribution and Goodness of Fit 


Chi-Square and A number of statistical analyses of data are linked by their common 

Goodness of Fit reliance on the chi-square distribution. The chi-square distribution is the 
distribution of the random variable Y, defined by Y = {YU}, where the U, 
are independent random variables, each distributed as a standardized nor- 
mal. That is, it has a mean of zero and a variance of one. Stated symboli- 
cally, U; is distributed as N(O, 1). The chi-square distribution depends only 
upon its degrees of freedom; in the above example, the distribution of Y is 
chi-square with N degrees of freedom. 

One of the first uses of the chi-square distribution you will encounter in 
your Statistical travels is the ‘‘goodness of fit’’ test. In the goodness of fit 
test you are asked to decide whether an observed set of relative frequen- 
cies is consistent with some theoretical predictions. So suppose that you 
have the traditional k cells; there are a total of N observations, n, of which 
are in the first cell, 1, in the second, and so on. 2, + m2 +++: tn, = N. 
You assume that someone (maybe your favorite uncle) has given you the 
theoretical probabilities P; of getting an observation in each of the cells. 
Thus P, is the probability for the first cell, P, is the probability for the 
second, and so on, so that the expected number of entries in each cell is 
NP,,NP2,. . . ,NP,. Your task is to compare the n; with the NP;,i = 1, 


Chi-Square 
Distribution 


Goodness of 
Fit Routine 


Contingency 
Tables 
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2,...,, and the statistic you calculate is written mathematically as 


Ww = (4 NE) 


where W is distributed as a chi-square variable with (k — 1) degrees of 
freedom. 

The APL expression is obtained in a straightforward manner. Our inputs 
to our chi-square function are two arrays, one for the n; and one for the P,, 
i= 1,2,...,4. So let’s begin by entering them: 


N«i5 74 11 6 17 
PI<6 op +6 


We have here an experiment to test whether a die is unbiased. On the 
assumption that the die is unbiased, P; = 1/6,i = 1, 2,...,k. This is a very 
simple function, so we can build up our expression from the middle (rather, 
from the most important part of the expression) as follows. Using paper 
and pencil, we put down 


First Stage: W-PIxWM<+/N 
Second Stage: ((W-PIxNN«+/N)*2)+NNxPI 


This will not do, since NN (reading from the right) has not been defined 
early enough. 


Third Stage: +/((N-PIxNN)*2) + PIxNN+ +/N 


A Suitable goodness of fit routine would look something like the follow- 
ing: 


V N GOODFIT PI 

C1] Ge+/ ((N-PIxNN)*2)3PIxNN<+/N 

[2] 'CHI-SQ GOODNESS OF FIT STATISTIC IS VSG. 

[3] 'WITH '; (pN)-1; ' DEGREES OF FREEDOM! 

[4] Vv 

Let’s try our function. Enter it into the computer, and then call it by 

N GOODFIT PI 

CHI-SQ GOODNESS OF FIT STATISTIC IS 13.6 

WITH 5 DEGREES OF FREEDOM 


Contingency Tables 


Now we can move on to something a little more ambitious that intro- 
duces some new APL concepts. Contingency tables are generated in the 
following manner. Suppose that someone takes a sample of people, or 
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machines, or whatever, and then classifies everyone or everything into two 
categories. For example, people can be classified according to blood type 
and color of eyes, machines by frequency of breakdown (i.e., once a 
month, twice, three or more times) and number of defective items pro- 
duced, and so on. The two-way classification produces a table of cells such 
as that shown in Table 9.1. 


Table 9.1. Two-Way Classification Scheme 


Color of Eyes 
Bl Br Gr G 


Blood 
Type 


Om > 


In this example there are 12 (3 x 4) cells altogether. The entry in each 
cell is the number of people in the sample who have the designated pair of 
classification characteristics. For example, the entry in the B row and Br 
column would be the number of people in the sample with blood type B 
and eye color brown. Everyone in the sample has to be in one (and only 
one) of the cells, so if we add up all the cell entries we get the number of 
people in the sample.* Let n;,,i= 1,2,...,r,j=1,2,... ,c denote 
the number of people in the cell (/,/), where r andc are the number of rows 
and columns, respectively, in the table. The hypothesis to be tested with a 
contingency table is that the observed entries in any row (or column) differ 
from the expected number in the row (or column) only by sampling varia- 
tion. The expected number in the cell (i,/) is given by (n;.n,;)/N, where nj, 
is the ith row sum and 7, is the jth column sum, and N is the total number 
of people in the sample (see Table 9.2). 


Table 9.2. Expected Number of Observations in the Two-Way Classification 
Scheme of Table 9.1. 


Color of Eyes 


The chi-square statistic for this problem is 


(ny — njnj/ NP? 
= r ¢ pS aL LB aa Co 
We Dir dior njnj/N 
which, if the assumption is true, is distributed as a chi-square distribution 
with (r — 1) x (c — 1) degrees of freedom. 


* Our appologies to those with blood group AB or hazel eyes. We simply wanted to keep the size 
of the table down. 


Reshape p 


Plus Reduction of 
Rows and Columns 
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Dyadic p: Reshape 


Our first problem in handling this situation is how to enter a table like the 
one above into the computer. This reintroduces the dyadic use of », called 
the ‘‘reshape’’ function. Consider the following examples: 


As2 39 1 2 
A 


B<2 291 234 


B 

de 

37 
C#2 2p 12345 6 
C 

1 2 

3.4 


The use of reshape rearranges the array given on the right-hand side into 
the shape denoted by the left-hand side. If D is an array, then F<¥ WV p D 
will take the first n elements of D and put them down as the first row of E, 
then the second n elements of D become the second row of E, and so on, m 
times. If there are too few elements in D, as shown in the example with 
A, then the elements of D are reused from the beginning until a table of 
appropriate size is created. In APL such tables are known as two-dimen- 
sional arrays or tables, as opposed to the one-dimensional arrays or lists 
we have been having fun with so far. 

Our first task is to obtain the row and column sums. With a one-dimen- 
sional array we know how to do this: +/(plus reduction). But with rows and 
columns, what happens? Try 


+/A 
45 


So +/ on a two-dimensional array gives us plus reduction of the rows of A. 
We get plus reduction of columns by a simple device. Try overstriking the 
reduction sign with the subtract sign; thus: 


+7A 
3.3 3 


gives us plus reduction down columns. 
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Outer Product 


Now that we see how to get row and column sums, the next step is to 
create the table of entries: (n,.n,)/N. This is most easily accomplished by. 
means of what in APL is called the outer product. Suppose that x and y are 
any two single-dimensional arrays, say m andn in dimension, respectively, 
and we want the table created by multiplying each element of x by each 
element ofy. Ifx has elements x,,%2,... ,X,, andy haselements y,,y.,..., 
Yn, then we want the (m x n) table of entries given by x,y,, X;V2, X1Y3,- + = 5 
XmYn alranged into an (m X n) table. This sounds like a very complicated 
arrangement, but once again what looks difficult is easy in APL. The (m X n) 
table we want is obtained by using the ‘‘outer product,” or jot dot product, 
which is o.x and is keyed by upper case -/ (gives the jot), period (gives the 
dot), and the multiply key. Thus we have 


T<Xo.xY 


and T is the desired (m X n) table of entries. Let’s try it. Recall the X¥ and Y 
that we have used before. In case you logged-off since last using them, we 
have 


X12 °3°4U5 76 7 
Ye? U6 84 26 
T+Xo .x¥ 


10 20 30 40 20 10 30 
“12 “24 “36 “48 “24 “12 ~36 
“14 “28 “42 “56 28 14 42 
If you examine the entries of T you will see that the entry in the (i, /) 
position is the product of the ith element of X and the jth element of Y. 
Now we can create a table of entries where the (i, /) entry in the table is 


(n;n;)/N. Let TB be the two-dimensional array of numbers n,;. Write down 
our first attempt at creating the table: 


R+«+/TB 
C<+4TB 
TB<TB-Ro .xCtN<++/R 


Line [1] gives the row sums and line [2] the column sums. The third line 
obtains N by adding up row sums, and divides that into the array of column 


Chi-Square 
Contingency Table 
Routine 
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totals which, in turn, is multiplied element by element by the elements of 
the array R. The new array is then subtracted from the original array 7B, 
element by element, and the result is stored in JB again to save space (the 
storing of a number of big tables soon uses up all of your available work- 
space). The new contents of 7B contain elements such as (mj—7,n,;/N). 

The elements of the new array 7B have to be squared and divided by the 
elements of (Ro.x(+N++/R). Let’s try again. 


NT+Ro .xC+N++/R 
+/+/((TB<TB-NT)*2):NT 


To be sure that you are following all this step by step, let’s actually per- 
form these operations with 7B. 


TB+2 302 31142 


R++/TB 
R 
6 7 
C<+7TB 
C 
3-23 
NT<Ro.xCtN« +/R 
NT 
1.384615358 3.230769231 1.384615385 
1.615384615 3.769230769 1.615384615 


A routine for calculating the statistic of a contingency table might look 
like the following: 


V CONTAB TB;R;C3;NT3W 
[1] R++/TB 
C2] C<+/TB 
[3] DF+(pTB)-1 
C4] WT+Ro.xC+N++/R 
CS] Wet/+/(( TB-NT)*2)=NT 
C6] 'THE CHI-SQ STATISTIC FOR A' 
[7] ‘CONTINGENCY TABLE IS '3;W 
[8] 'WITH ':DF[1]xDF[2];' DEGREES OF FREEDOM! 
[91 Vv 
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This routine does not produce a specific result, and variables used as 
intermediate output are made local to the function. Note that this saves 
room in the limited workspace available to you. Otherwise, the intermedi- 
ate products would sit around until you reused them or erased them. Let’s 
try it. Type in 


TB<3 4 918 29 70 115 17 28 30 41 11 10 11 20 
CONTAB TB 

THE CHI-SQ STATISTIC FOR A 

CONTINGENCY TABLE IS 19.94264769 

WITH 6 DEGREES OF FREEDOM 


One-way Analysis of Variance 


So much for testing hypotheses about contingency tables. The last type 
of statistical analysis we will handle in this chapter is analysis of variance 
(abbreviated ANOVA). Only the simplest types of problems will be dealt 
with here. The analysis that follows is called one-way analysis of variance. 

Once again imagine that we have a table of entries. Each column of 
entries might represent, for example, crop yields with a given type of 
fertilizer on different types of land, or each column might represent typing 
speeds of various typists on a variety of machines. 

If x, represents the entry in the (i, /)th position, for example, the ith 
typist using the jth machine, then x, X2,... , %, represent the mean 
typing speeds of all typists using the first machine, the second machine, 
and so on up to the cth machine—there being c machines and c columns. 
Let + represent the overall mean, i.e., the mean of the column means. 

The main concept in analysis of variance is to examine the breakup of the 
total sum of squares just as we did in the regression section. Thus we 
consider the identity 


Day hha (Ky — xP = Def (xy _ x3)? + Dia Dhar (X — x)? 
TSS = ESS + CMSS 


where TSS = total sum of squares, ESS = error sum of squares, and 
CMSS = column mean sum of squares. CMSS can be rewritten as 
Dia Dh (%; — XP = rBsiy(v; — x). 

The test of the hypothesis that there is no difference in the column means 
is obtained by looking at the ratio of CMSS/(c — 1) to ESS/(re — c) which, 
under the null hypothesis of no difference, is distributed as F with (c — 1) 
and c(r — 1) degrees of freedom. This is fully explained in statistics books; 
see, for example, Mendenhall and Reimuth, which is listed in the bibliog- 
raphy. Consequently, we want to calculate the statistic F defined by 


Fz CMSS[c(r — I] _ rer -— 1) Shy; - x) 
~ ESS(c _ 1) 7 (c _ 1) DyaylXy _ x)? 


Index of an Array 
of Dimensions 


One-way ANOVA 
ANOVA! 
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Let us suppose we have a table X of c columns ofr entries each, and we 
want to do a one-way analysis of variance on X by columns. From the 
mathematical expression above we need column sums and the overall sum; 
but this is now very easy! Write down the following first attempt: 


MCOL+( +#X)+R«(p X)T1] 

Me( +/MCOL)+C+( 9 X)E2] 

NUM*RxCx( R-1)x+/( MCOL-M)*2 

DEW+( C-1)x+/+/ ( X- (B,C) pMCOL)*2 
F<NUM+: DEN 


The only new element here is (pX)[1] and (oX)([2]. The expansion 
(pX) is an array of the dimensions of X, so that (pX) [2] gives the second. 
If the parentheses had been left off, any attempt to execute pX¥[1] would 
have given an error; thus 


o X [1] 
RANK ERROR 
1 
p xt J 


The reason for this is that once X has shape r X c, the expression X[1]is 
invalid. What is needed to index a two-dimensional array is a pair of 
indexing numbers, e.g., X[1;2]. More of this later. What we wanted, of 
course, was the first element of the one-dimensional array (pX) . 

You should also recall that to obtain an (R x C) table from MCOL, an 
array of C elements, we must write (2,C) oMCOL and not just RéC pMCOL, 
which produces a SYNTAX ERROR .Thisis because the pair of variables? Cis 
not an array even though 8 3 would be. To produce an array with a set of 
variables it is necessary to catenate them; thus, (?,C) is an array of two 
elements, 7 and c. 

An example of an analysis of variance routine for this simple problem is 


Vv ANOVA1 X 
[1] MCOL+(+4X)+R+(o X)[1] 
[2] Me(+/MCOL)2C+(o X)[2] 
[3] NUM«RxCx (R-1)x+/ (MCOL-M)*2 
[4] DEN (C-1)x+/+/(X-(R,C) p MCOL)*2 
[5] P<NUMsDRN 
[6] ‘THE F STATISTIC FOR ANALYSIS OF' 


[7] 'VARTANCE ACROSS COLUMNS IS ':3F 
[8] "WITH '; (C-1), 'AND ';Cx(R-1);' DEG FREEDOM ' 
[9] Vv 
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Now that we have it, let’s try it. Type 
TAB+8 30 44 4O 54 39 37 50 33 28 40,0 


QO: 

56 53 55 43 38 45 56 51 66 47 45 49,0 
(]: 

58 60 65 

ANOVA1 TAB 


THE F STATISTIC FOR ANALYSIS OF 
VARIANCE ACROSS COLUMNS IS 1.868644068 
WITH 2 AND 21 DEG FREEDOM 


Two-way Analysis of Variance 


The last problem that we will consider in this section is two-way analysis 
of variance. As before, we have a table of entries with r rows and c 
columns. With one-way analysis of variance we calculated only column 
means; in two-way analysis we calculate both column and row means. As 
before, the analysis of the test of the null hypothesis of ‘‘no effects’’ is 
based on the breakup of the total sum of squares. Consider the identity: 

Liar Dfar(xy — X)? = Var Bfar(xy — xy — Xi + X)? + Lhe Dfe(Ki, — x)? 
TSS = ESS + RMSS 


+ Bh Bfale 5 — ¥)? 
+ CMSS 


x is the overall mean, x; is the mean of the ith row, and x, is the mean of 
the jth column. RMSS is the row mean sum of squares and CMSS is the 
column mean sum of squares. CMSS can be rewritten (as before) as 
r2f21(%5 — x)? and RMSS as cD?_,(x%;, — X)?. 

With these statistics we can test three different hypotheses: 

H,: Row means are equal, column means are unspecified. 

H,: Column means are equal, row means are unspecified. 

H,: Row and column means are both equal (not necessarily to each 

other). 


Hypothesis H; is tested by the statistic F; defined mathematically by: 


Fr = —RMSS/(r = 1) 
‘" ESS/[(r — De -— DJ 

F, = —CMSS/(c = 1) 
>” ESS/[(r — Ic - 1] 


ANOVA2 
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p, = QRMSS + CMSS)/[(r = 1) + (¢ = 1) 
a ESS/[(r — De — 1] 


Under the null hypothesis of no row or column effects, F , is distributed as 
F with (r — 1) and (r — 1)(c — 1) degrees of freedom, F,, as F with (c — 1) 
and (r — 1)(c — 1) degrees of freedom, and F, as F with (r — 1) + (c — 1) 
and (r — 1)(c — 1) degrees of freedom. 

All we have to do now is to program a routine for calculating the statis- 
tics required for a two-way analysis of variance. Let’s modify ANOVAI 
and call it ANOVA2. 

So far we have not paid much attention to trying to make our routines 
computationally efficient or compact in terms of size of workspace used. 
This is because we felt you had enough to do in learning statistics and the 
basics of APL all at once. But, since ANOVA? will be similar to ANOVAI, 
let’s take the opportunity to do things a little more efficiently than before. 
To this end, let’s expand and simplify algebraically each of the sums of 
squares listed above. 

ESS: 7-,2failx3, + 3? + 32 + 2 + 2-H, -— €5+ D 
—2%i(—-X 5 + X) — 2X 5K] 


= Dh lh yxy — cl XP — rd x2, + rex? 
RMSS: 27, 2f.[ x? + x? — 2x; x] 
= cD ,X? — rex = c[D x? — rx*] 
CMSS: 2H 2fe 1, + 2? — 2% x] 
= 73,2, — reX = [Dp - c#] 
From the above expressions we see that the major sums needed are 22x}, 
2x}, 2x’, and x?. We might begin by putting together the various sums and 


parameters required to calculate the main results. Consider, for a table X 
of entries 


R«(p X)C1) eee us the R, C values 


C+(p X)[2] needed in the calculations 
XR++/X Gives the row and column 
XC<+4X sums 


As noted above, there are three possible F values we can calculate, all of 
which require ESS/[(r — 1)(c — 1)] in the denominator. Consider the 
following possible form for ANOVAZ2: 

V ANOVA2 X;RM;CM 
[1] R+(o X)C4] 
[2] c<(p x)[2] 


ANOVA2 
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r, = (RMSS + CMSS)/[(r = 1) + (¢ = 0] 
a ESS/[(r ~ I)(c — 1)] 


Under the null hypothesis of no row or column effects, F’, is distributed as 
F with (r — 1) and (r — 1)(c — 1) degrees of freedom, F,, as F with (c — 1) 
and (r — 1)(c — 1) degrees of freedom, and F, as F with (r — 1) + (c — 1) 
and (r — 1)(c — 1) degrees of freedom. 

All we have to do now is to program a routine for calculating the statis- 
tics required for a two-way analysis of variance. Let’s modify ANOVA1 
and call it ANOVA2. 

So far we have not paid much attention to trying to make our routines 
computationally efficient or compact in terms of size of workspace used. 
This is because we felt you had enough to do in learning statistics and the 
basics of APL all at once. But, since ANOVA? will be similar to ANOVA], 
let’s take the opportunity to do things a little more efficiently than before. 
To this end, let’s expand and simplify algebraically each of the sums of 
squares listed above. 


ESS: Dr, Dfaifx3, + x? + x2, + 2 + 2%,(-X,, — ¥; + ZX) 
—2x;(—xX; + X) — 2x 5x] 


= They x}, — cD x? — rd x, + rex 
RMSS: 27_,2f.,[x?, + x? — 2x; x] 
= ch ,x7 — rex? = cl 2X? ~ rx?) 
CMSS: 27.2% 1%, + 3 — 2x x] 
= D522, — rex? = r[ Dye, -— cx?) 
From the above expressions we see that the major sums needed are 22x?, 
2» x?, x2, and x?. We might begin by putting together the various sums and 


parameters required to calculate the main results. Consider, for a table XY 
of entries 


RP<e(o X)C1] { Gives us the R, C values 


C+(p X)[2] needed in the calculations 
XR++/X Gives the row and column 
XC#+7#X sums 


As noted above, there are three possible F values we can calculate, all of 
which require ESS/[(r — 1)(c — 1)] in the denominator. Consider the 
following possible form for ANOVA2: 

V ANOVA2 X;RM;CM 
[1] ReCp X)C1] 
[2] C«(p X)[2] 
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[3] RM +/X):C 

[4] GMC +/ ( CMe ( +/X)4R))4C 

[5] RMSS+Cx( (RRMe+/( RM*2))-RxGM*2) 

[6] CMSS<Rx( CCM<+/CMk2)-CxGMx2 

L7]) BSS*«(+/+/X*2)+(( -CxRRM)+( ~RxCCM) )+CxRxGMx«2 
[8] DEN«ESS:DDF+(( R-1)x( C-1)) 

[9] PF1<RMSS+DENx( R-1) 

[10] F2<CMSS:DENx( C-1) 

[11] F3<(€ RMSS+CMSS)+DENx( R+C-2) 

[12] 'THE F STATISTICS FOR TWO-WAY ANOVA ARE: ! 
[13] 'TESTING ROW MEANS, F IS ';F1 

[14] "WITH '; ( R-1),DDF;' DEG. FREEDOM! 

[15] 'TESTING COL. MEANS, F IS '3F2 

Li6] "WITH '; (C-1),DDF;' DEG. FREEDOM? 

[17] 'TESTING ROW AND COL. MEANS, F TS ':P3 
[i8] ‘WITH '; ( R+C-2),DDFP;' DEG, FREEDOM' 

[i9] Vv 


There are a few points in this routine worth noting. First, the number 
of operations performed has been reduced. This was made possible by 
showing that the required algebraic expressions depended upon a small 
number of partial sums. Second, intermediate results are stored in a 
variable for use later in the routine (see, for example, lines [4], [5], and [6]). 
Third, an attempt was made to ensure that where a sum of variables is 
divided (or multiplied) by the same constant, the sum is taken first, then 
the result is divided (or multiplied). In general, it is better to multiply 
before dividing in a complicated expression since this reduces the propaga- 
tion of errors introduced by the process of division. This rule was not 
followed entirely in this routine since it was, in fact, more efficient to form 
the denominator first, because it was used repeatedly. 

Line [7] is interesting in that it poses a trap for those who forget 
that the computer operates from right to left. If we had written 
BSS«+ /+/X*2-CxRRM-RxCCM+CxRxGM, we would have obtained an errone- 
ous result. Cx#GM” is alright, but then this is added to CCM, which sum is 
then multiplied by R, leading to the first error. The result (of 2xCCMxCxRxGM) 
is subtracted from RRM, leading to the second error, and so on. Note that 
the minus operator signs used with R and C are being used in their monadic 
sense; —C and —R change the signs of C and R. 
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60 65 
ANOVA2 TAB 


ANOVA2 THE F STATISTICS FOR TWO-WAY ANOVA ARE: 


TESTING ROW MEANS, F IS 29.46666667 


WITH 7 14 DEG. FREEDOM 


TESTING COL. MEANS, F IS 19.6 


WITH 2 14 DEG. FREEDOM 


TESTING ROW AND COL. MEANS, F IS 27.27407407 
WITH 9 14 DEG. FREEDOM 


While this routine is neither as compact nor as elegant as it might be, it 
has served the purpose of demonstrating some of the factors you should be 
beginning to take into consideration. The major ones are: 


1. 


Keep the use of big arrays, especially tables, to a minimum—they use 
up an incredible amount of workspace. 

Simplify the algebra as much as possible before beginning to write the 
routine. 

Avoid repeating calculations. 


9.4 * Calculating the Chi-Square and F Distributions 


In this short section we will discuss how to calculate the chi-square and F 
probability distributions so that you need not use the tables and can pick 
confidence levels and sizes of tests not available there. More importantly, 
by plotting the distributions you can acquire a better understanding of 
them than would otherwise be the case. 


Let’s begin with the chi-square distribution. If Y is distributed as chi- 


square with N degrees of freedom, its density can be written as: 


YX2-D Exp (= Y/2) 


F(Y|N) = T(N/2) 2° 


(See, for example, Press in the bibliography.) 


160 


Elementary Linear Regression, Goodness of Fit, and Analysis of Variance (ANOVA) Problems 


TY Gamma Function The non-APL symbol I'(N/2) represents the gamma function, which 


1X 


generalizes the factorial. If N/2 is an integer, then [(N/2) = (N/2 — 1)!. 
T(p) is defined by the integral 


l(p) = ie xP le Tdx 


Ifp is an integer, I(p) = (p — 1)!. The chi-square distribution of Y with 
N/2 degrees of freedom is nothing more than the gamma distribution of the 
variable Y/2 with parameter p = N/2. In APL, I'(N/2) is easily calculated; 
in fact, [(N/2) is given by !(N + 2) — 1, even where N ~ 2 is a noninteger. 

In order to calculate the integral of F(Y|N) from zero to some bound B, 
we will have to use a numerical approximation. The simplest proce- 
dure to follow is that used in obtaining the Normal integral whereby we 
broke the interval over which the function is to be integrated into a series 
of small intervals and added up the approximate areas to get the integral. 
Before continuing you might wish to review briefly section 6.3 in Chapter 6. 

How many intervals are needed? Or rather, how small should each 
interval be? If we regard the interval from 0 to 10\/2N, where /2N is the 
standard deviation, as the effective range of the variable Y, i.e., integration 
of the density function over this range gives a value close to one, and if we 
regard a suitable interval length as a 32nd of a standard deviation, then in- 
tegration over the effective range would involve 320 intervals. For an inte- 
gration from 0 to B, we need to determine how many intervals are needed. 
Note that we do not want to divide the interval 0 toB by some fixed number 
since the accuracy of the integral would vary tremendously with variations 
in the value of B. Figure 9.1 will help clarify our stratagem. In the sample 
shown, the interval zero to B should be broken up into 9 sub-intervals. 


Figure 9.1 Illustration of a Method for Determining Integration Intervals 


Intervals for Range (0, 10 V2N) 


Oo B 10 V2N 
e e 
e e 
lelelelelelelelelel 
O B 


| Represents interval boundaries 
e Represents interval mid points 


Thus, the required number of sub-intervals can be obtained in APL as 
follows: 


Ie (€0,1320)x((2xN)*0.5)732 


NINT<«+ /B>L 


The first line produces 321 points on the interval from 0 to 10 V2N and the 
second line gives, by compression, the numbers of such points less than B 
and hence the number of intervals into which 0 to B is to be divided. 


Cumulative 
Chi-Square 


Comparison of 
APL-STAT Results 
and Tabulated 
Values 
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The idea underlying the approximation of the integral is uncompli- 
cated.* On each sub-interval, we will approximate the actual area under 
F(Y|N) by a rectangle whose base is given by BW (B — 0} + NINT and 
whose height is F(¥*|N), where Y* is the mid-point of the interval. This 
approximating procedure is known as the rectangular method. It is the 
same as that used in Chapter 6 to calculate the cumulative normal distribu- 
tion. A brief glance at figure 6.2b will remind you of the main idea. 

The sequence of mid-points at which F(Y|N) is to be calculated is given by 


YSC( (1NINT )xBW)-BW+2 


In Figure 9.1 above, these points are shown as the dots on the interval 
{0, B]. 

We now have most of the pieces, so that all we have to do is fit them 
together. Consider 


V WN CUMCHTSQ B;YS <N = degrees of freedom 
B = boundary of integration 
[1] DEN<(2*NN)x!(NN+NS2)-1 <Calculates denominator 
of F(Y|N) 
[2] MNINT«+/B > L+(0,1320) <—{ Determines number of 
x((2xN)*0.5)#32 intervals needed 


[3] YS«(( iNINT)xBW)-(BW+B+NINT)+2 <Midpoints of the intervals 
[4] FYS+( YS*(NN-1))x*(-YS%2) <-Evaluates the density at YS 


[5] We(W++/FYSxBW)+DEN </( W gives probability of 
F ‘ random variable distributed 
[6] THE CUM CHI SQ DISTN ak F(Y|N) of being less 
[7] ‘WITH '3;N3' DEG FREEDOM! than B. 
[8] 'FC(B) IS 'swW 
ESd MhertR) fe CTW) 


[10] Vv 


Let’s try our function and compare its results with those obtained from 
the Biometrika Tables prepared by Pearson and Hartley. The official tables 
give the following numbers. 


Biometrika Table Chi-Sq Values 


B 0.1 1.0 5.0 10.0 
D.F.\ 


l 0.24817 0.68269 0.97465 0.99943 
2 0.04877 0.39347 0.91791 0.99326 
5 0.00016 0.03743 0.58412 0.92476 
10 —_ 0.00017 0.10882 0.55951 
20 — — 0.00028 0.03183 


* Any advanced calculus text or book on numerical approximation will discuss various methods. One 
useful reference is W. Kaplan, Advanced Calculus, Addison-Wesley, 1952, Reading, Mass. 
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Approximate APL Chi-Sq Values 


B 0.1 1.0 5.0 10.0 
D.F.\ 


1 20434 63228 92400 94767 
2 04877 39345 .91788 99322 
5 .00016 .03741 58414 92478 
10 — 00017 10881 55951 
20 — _ .00028 .03182 


There are a few items worth mentioning with this routine. First of all, in 
the header to the function, the name was given as CUMCHIS@, instead of 
something like CUMCHI-S@Q. The reason for this is that a symbol such as 
‘*—°* cannot be used in the definition of a function name. Further, and very 
importantly, the way in which the function is written, the arguments N and 
B must be scalars, not arrays. If you define either N or B as an array and 
attempt to use CUMCHISQ, you will get the response 


LENGTH ERROR 
CUMCHISQL1] NINT++/B > L*(0,320)x(2xN)*0.5)#32 
A 


if N is an array, or the caret will be under B, if only B is an array. 
However, the most important lesson from the above tables is that while 
the routine provides a useful approximation to the correct values for de- 
grees of freedom greater than 1, the approximation is very bad for one 
degree of freedom, and we shall not bother with this now. However, you 
should note that the difficulty is not simply one in which the interval width 
is too broad. If you were to examine a plot of the densities for 1, 2, 3, and 4 
degrees of freedom, you would see considerable differences in the shape 
of the chi-square distribution as the degrees of freedom increase in this 
range as shown in the graph in problem 10. What is required is a much more 
accurate approximating procedure—our simple rectangular procedure is 
inadequate for the task; some suggestions are contained in the exercises. 
Let us consider the F distribution with k, and k, degrees of freedom. The 


density function can be written as: 


— ki/ key 20 (ka + ke)/2) 
fw) = TTDI aD 


The terms involved in the constant of integration present no difficulty after 
the chi-square distribution. Since our strategy for obtaining the cumula- 
tive distribution worked reasonably well with the chi-square distribution, 
let’s try it with the F distribution, whose standard deviation in algebraic 
terms Is: 


wi2D-1( | + (ky / Ka) w) (it kee 


ky (ke ~ 2)*(ke ~ 4) 
where k must be greater than 4 for the variance to be defined. 


The F cumulative distribution function routine can be set up in a manner 
very similar to that of the chi-square distribution. The main differences 
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occur in the fact that the F distribution has two degrees of freedom instead 
of one and ‘‘FYS”’ has a different definition in the two routines. Try the 


following: 

V WeP CUMF B 
[1] aTHIS ROUTINE CANNOT BE USED WITH 
[2] LESS THAN FIVE DENOM, DEG. OF FREEDOM. 
[3] CONST+(!((PS<+/P)#2)-1)*PL1]*P1+PL1]+2 
Cu] CONST<CONST:(!(PL2]+2)-1)x(!P1-1)*xPL2]*P1 
[5] STD<2x(PS-2)xP[2]*2 
[6] STD<(STD+(PL1Ix(PL2J-4)x(PE2)-2)*2))*0.5 
[7] WINT++/B > L<(0,1320)xSTD+32 
[8] YS<(( wWINT)xBW)-(BW<B+NINT) +2 
[9] FYS<(¥S*P1-1)x(1+(PL1]+PL2])xYS)*( -PS#2) 
[10] We(We+/FYSxBW)xCONST 
[11] ‘THE CUMULATIVE F DISTN WITH' 
[12] PC1];' AND ';P[2];' DEG. FREEDOM‘ 
[13] ‘IS t3W 
[14] Vv 


The first two lines of this routine form a comment which will warn the 
user when he displays the function that it cannot be used with less than five 
denominator degrees of freedom. The comment symbol is made by keying 
uppershift C (called cap), backspace, and uppershift -/ (or jot). Everything 
to the right of this symbol is regarded as a comment. When the function is 
executed, comments are ignored; they are only printed when the function is 
displayed. 

B is one limit of integration; 0 is assumed to be the other, in that we are 
calculating the probability of the random variable being contained in the 
interval [0, B]. The array P contains the degrees of freedom, k,, ko. 

In the calculation of the integral of f(w) there are two component parts 
multiplied together, a constant: 


(ki /k)"?T((ky + kp)/2) 
D(ky/k2)U(ke/2) 
and that portion of the function to be integrated is 
whi 2-1 | + (ky /ky)wy * +ke!2 
In calculating the integral the multiplicative constant can be calculated 


separately from the second part and the two multiplied together to get the 
answer. 


164 


Example of Use of 
Cumulative F 
Distribution Routine 


Elementary Linear Regression, Goodness of Fit, and Analysis of Variance (ANOVA) Problems 


Lines (3] and [4] determine the value of the constant term; line [3] 
produces the numerator and [4] the denominator. Remember that I'(4,/2) is 
obtained in APL by !(P[1]+2)-1 since algebraically, [(k,/2) = (k,/2 — 
1)!, when k,/2 is an integer. The APL function ‘‘factorial’’, !, is really the 
gamma function. 

Lines [5] and [6] calculate the standard deviation of the distribution 
which is needed for determining the interval widths. 

Lines [7] to [10] determine the probability of the random variable lying in 
the interval [0, B] in a manner very similar to the previous effort. 

As a check on the accuracy of the routine, examine the following table. 
The table shows the probabilities obtained from the routine using various 
combinations of degrees of freedom and bounds. At the head of each 
column the theoretically correct probability is listed. 


Table of Probabilities from F Distributions* 


Prob. 
D.F. 75 95 99 999 


(1,5) 0.66 0.86 .90 0.91 
(1.69) (6.61) (16.26) (47.18) 
(1,10) 0.69 0.89 0.93 94 
(1.49) (4.96) (10.04) (21.04) 
(2,5) 0.75 0.95 0.99 0.998 
(1.85) (5.79) (13.27) (37.12) 
(4,5) 0.75 0.95 0.99 1.000 
(1.89) (5.09) (11.39) (31.09) 
(4,10) 0.75 0.95 0.99 0.999 
(1.59) (3.48) (5.99) (11.28) 
(10,20) 0.75 0.95 0.99 0.999 
(1.40) (2.35) (3.37) (5.08) 


* The bound B is given in parentheses below the probability. 


Once again, as with the chi-square distribution, we see that one degree 
of freedom in the numerator causes difficulties with the approximation. As 
both degrees of freedom increase, the relative accuracy increases. This is 
because the functions being approximated are easier to handle with our 
simple rectangular procedure. Greater accuracy and less computation can 
be obtained with a more sophisticated procedure, such as the use of 
Simpson’s rule. 

An example of use of the routine is: 


P 


P CUMPF B 


Summary 
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THE CUMULATIVE F DISTN WITH 
1 AND 5 DEG. FREEDOM 
IS 0.662880113 


0.662880113 


REGRESS: Simple least squares regression—an APL routine 
was provided to calculate most of the relevant statistics in a simple linear 
regression model of the type Y= a + bX + U. 

GOODFIT: a routine to perform goodness of fit tests. 


CONTAB: aroutine for calculating the statistics required in tests 
of hypotheses within a contingency table. 

ANOVAI: a routine for carrying out simple one-way analysis of 
variance. 

ANOVA2: a routine for performing two-way analysis of vari- 
ance. 

Reshape, po (uppershift ®): a dyadic function which rearranges 
the ‘‘array shape’”’ of the right-hand argument as specified by the left-hand 
array. Example: 


C+2 3 0A 


rearranges the elements of the array A into a table (matrix) composed of 
two rows of three columns each; elements fromA are stored in the variable 
C row by row in sequence from the elements of A. 

Outer Product, o.x (jot dot) (uppershift 7), period, multiplica- 
tion): a dyadic function which multiplies each of the m elements in the 
left-hand argument array with each of the n elements in the right-hand 
argument array to form a table of dimension (m xX n) containing all m x n 
multiplications. 

Reduction, /: a monadic function. When used over two-dimen- 
sional arrays (tables or matrices) f/ TABLE produces an array formed by the 
f reduction of each row of TABLE, where f is one of the arithmetic 
functions. 

Reduction, / by columns for a two-dimensional array is obtained by 
using / (reduction, backspace, minus sign). 

Gamma function, ! (uppershift X, backspace, period): a monadic 
function to evaluate the gamma function, which generalizes the factorial 
function. 'P in APL produces the mathematical result of [(P + 1). 

CUMCHISQ: a routine to calculate the integral of the chi- 
squared density function. 

CUMF: a routine to calculate the integral of the F' density func- 
tion. 
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Exercises 


Elementary Linear Regression, Goodness of Fit, and Analysis of Variance (ANOVA) Problems 


Comment, 4 (uppershift C, backspace, uppershift 7) used as the 
first character of a line inside a function to provide explanatory comments 
when function is displayed. Comments are not printed on execution of 
function and are otherwise ignored. 


APL Practice 


1. 


For the two arrays %+120 and Y+ 5+120, use your knowledge of APL 
to perform the following calculations based on the mathematical for- 
mulas listed below. These formulas are useful in regression analysis. 


(a) B= PHes Bas XY; = y) 


estimator of the regres- 


(x; — x)? sion slope coefficient. 
x = Sx,/n and 
y = By;/n 
(b) A= y — Bx estimator of the con- 
stant term 
(c) uj = y; -A — Bx, t=1,2,...,2n vector of estimated 
errors 
(d) S34? = S(y, — A — By,)? sum of the squares of 
the error 
(e) & = Tw estimator of variance 
oP aD? of disturbance terms 
(f) 53 = ji estimator of the vari- 
o 3x, — xP ance of the slope 
1 x estimator of the vari- 
2 = 2 
(8) Sa = Su (; = Xx; - ¥) ance of the constant 
term 
= =x? 5 estimator of the vari- 
(hy COV (A, B) = Bre = al Su ance of the covariance 
of the estimators A 
and B 
; _— 2x, — XC — Y) 
Oi nS ,Sy 
where 
S, = X(x; — x)?/(n — 1), and simple correlation 
Sy aye yy) ne 1) coefficient of x and y 
G) SST = X(y, — y)? total sum of squares 
SSR = ((A + Bx;) — 5»)? regression sum of 
squares 
SSR F angers 
(k) RS = Gea; and compare your answer with that in (j), the coeffi- 


cient of determination. 
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the variance of the 
predicted value of y 


: as 
) Viyl%o) = Si [; > se =F 


given Xo 
(m) T = y — (A + Bxy) t-statistic of the pre- 
i l (x) — x)? dicted value 
(st [1+ 9+ SG =9]) 


2. This exercise introduces you to some novel ideas about constructing 
some matrices that you will find useful in the following chapters. Let 
T1110. 


(a) Io.=I produces the identity matrix 

(b) 0 3[1+Z°.<Z] produces an upper triangular matrix 

(c) 0 3L1+F°.=Z] produces a diagonal matrix 

(d) 2 3£1+(Te°.=7)#1]a symmetric matrix 

(e) 10 10 6111 a circular matrix 

(f) 10 10 p2 3,(8 00),3 a tridiagonal matrix 

(g) Vo.=Y where +i 142 2 2 2 3 33 3 3, block diagonal 
matrix 


3. The following exercises are basic to calculations in ANOVA problems. 
For any matrix X,,, where n is the number of rows and k is the 
number of columns, use your APL to write a routine to calculate each 

of the following: 


(a) Row means 

(b) Column means 

(c) Total mean and compare to (d) and (e) 
(d) Mean of row means 

(e) Mean of column means 


(f) The mean of the squared differences of each element from the total 
mean 

(g) The mean of the squared differences of each element from its own 
row mean 

(h) The mean of the squared differences of each element from its own 
column mean 


(i) The mean of the squared differences of each element from its own 
row maximum value 


4. You might want to see where the total variation of any contingency 
table might come from. Let the matrices A, B, C, D represent four 
different sets of tabulated data of ninety observations which you wish 
to analyze. 


10 10 10 10 10 10 
A=/10 10 10 B=] 4 10 16 


10 10 10 16 10 4 
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8 8 14 8 8 8 
C=] 4 14 12 D=| 4 14 12 
9 8 13 15 8 13 


Use your program in Exercise 3 to get all the quantities given by the 
program for all four matrices. Comment on the results. 


*5. Let F(P) = ie xP le—tdy 


Find F(20) and F(3.5) using the generalized APL factorial. 


Statistical Applications 


1. You are given the following data: 


Xx 65 63 67 64 64 68 62 70 66 68 67 69 71 
Y 68 66 68 65 69 66 68 65 71 67 68 70 65 


where X is the height of the father and Y is the height of the son, both 
measurements taken to the nearest inch. The objective is to find out if 
the height of the son depends on the father’s height, and if so what is 
the specific relationship between the fathers’ and sons’ heights. Use 
your Y REGRESS X function (page 146) to find out which one of the 
following regressions fits the data best. 


(a) Y=at+bxt+u 

(b) Y=atbet+u 

(c) Y=at+binx+u & e¥ = etre" 
(dj) InY=Inat+xt+u < Y = aete" 
(ec) nY=Ina+blnx+u © Y= axe" 
(ff) Y=a+t(b/x) +u 


Use as your criterion for best fit that regression which produces the 
highest sample value for the coefficient of determination. 

For each equation plot the residuals 7 = Y — Y against Y. Comment 
on your observations. 


2. Use the routine Y REGRESS X (page 146) to solve the following prob- 
lem. 
You are given the data: 
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where X = weight to the nearest pound, Y = height to the nearest 
inch, and Z = age to the nearest year of 12 boys. 

Run the following regressions in order to discover to what extent 
the variables X, Y, and Z are linearly related. 


a,: X=atbyt+u, 
ay: Y=at+bxt+ ty 
Ag : Y=a+bz+ us 
ay: X=art+bz+uy 
After making the required transformations, run the regressions 
as: log, Y= a + b logy X + us 
ag: Y=a+ b/x + us 
a;: Y=a-+blogX + u,; 
ag: Y =a + blog Z + ug 
Ay: Z = x®e"y, which can be written as Z = In a+ 
bin x + uy 


Explain intuitively the meaning of each equation and of the estimated 
ds and bs. 


3. The following data give the yields of wheat on some experimental 
plots of ground corresponding to four different sulphur treatments for 
the control of rust disease. The treatments consisted of: 

(1) dusting before rain 

(2) dusting after rain 

(3) dusting once each week 

(4) no dusting 

Test to see if there are any significant differences in the yields due to 
the dusting methods. 


Dusting method 


Plot 1 2 3 4 
1 5.3 4.4 8.4 7.4 
2 3.7 5.1 6.0 4.3 
3 14.3 5.4 4.9 3.5 
4 6.5 12.1 9.5 3.8 


4. The number of units of work done per day by five workers using four 
different types of machines is given in the following table. Each 
worker operated each type of machine for one day. Find estimates of 
the differential effects due to 
(a) machine type 
(b) worker’s skill 


In each case, specify carefully the maintained, null, and alternate 
hypotheses 
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Units of Work Output by Type of Machine and Worker 


Machine Type 


Worker I II Ill 
1 40 40 48 
2 40 42 50 
3 35 37 45 
4 42 36 48 
5 36 40 50 


(c) Now suppose that you were not able to get the observation (3, 
III), i.e., the number of units for the third worker using the third 


machine. How would you answer questions (a) and (b)? 
Some suggestions are: 


1. Put some row or column average in the (3, ITI) position. 


2. Change the routine in such a way that you need use only three 
observations for the third row and four observations for the 


third column. 


5. Put the cumulative F distribution function (page 163) into your work- 
space. Add some lines to ANOVAI and ANOVA2 functions already 
defined in order to give the answers of hypotheses tests immediately, 


so that you don’t have to look up the F tables. 


6. Add some lines to the function Y REGRESS X to obtain the following 


statistics. 

(a) the vector of calculated Y, call it ¥ 

(b) the vector of calculated residuals, call it 7 = Y — Y 
(c) the mean of X, Y, and Y 

(d) var (U) and var (Y) 


7. Poultry researchers investigated the weight gains (in pounds) of four 
types of ‘‘super’’ or ‘‘Industrial’’ turkeys fed three different rations 


over a period of several months. The results are listed below: 


Type of Ration 


Type of 

Turkeys 1 2 3 
A 50 45 35 
B 4] 38 45 
Cc 61 35 55 
D 55 59 61 


Find estimates of the differential effects of (a) rations, and (b) 


types, and test the hypotheses: 


(a) Hy: The variations of weight gains is due to the different rations, 


regardless of the type of turkey. 
against H,: The rations have no effect on gains. 
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(b) Hy: The variation of gains is due to the type of turkey, regardless 
of the rations. 
against H,: The type of turkey does not affect gains. 

(c) Hp: Neither the type nor the ration affects the weight against H,: 
At least one of type or ration affects the weight. 


8. The following diagram will help you understand the role of the degrees 
of freedom () of the X? distribution. Let X? represent the random 
variable and f(X”) the corresponding density function. 


Notice that for the same upper bound (8) and increasing degrees of 
freedom the value of the integral, . fi) dx*, decreases inn, 


(a) Use your V CUMCHIS@ Bfunction (page 161) to verify this for N = 
1,2,3,...,20andB = S$. 

(b) Use the WV CUMCHTSQ B function (page 161) to calculate the 
integrals for N = 20 and B = 1, 2, 3, 4, 5, 6, 7, 8, 9, 10. 

(c) Alternatively. How would you use the ¥ CUMCHIS@ B function to 
determine the value of B for which the area from 0 to B is 87% 
when N = 20? Notice that the value of 87% is not in the tables of 
the chi-square distribution given by most textbooks. 
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Matrix Algebra in APL— 
How Simple It Is 


10.1 Vectors, Matrices, and Arrays 


Vectors, Matrices, 
and Arrays 


Arrays, Column 
Vector, Row Vector 
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Up to this point in our discussion we have dealt with variables which can 
be scalars or which can be arrays; arrays have one or more dimensions, 
scalars are dimensionless. You will recall that if we define the variable V by 
V «‘*some number,”’ >) produces ‘‘blank’’ because a scalar has no dimen- 
sion, and ee V (which is the dimension of the number of dimensions) 
produces 0. But if V is defined by Y «‘‘a list of numbers,’’pV produces the 
number of elements in the list and poV produces 1 (the number of ele- 
ments in the list of dimensions). 

Chapter 9 introduced a very important extension to our definition of 
variables—variables defined as two-dimensional arrays which can be visu- 
alized as a table of entries with rows and columns. In this chapter we will 
develop our APL tools for handling two-dimensional arrays and we will 
distinguish between APL expressions such as ‘‘arrays’’ and mathematical 
expressions such as ‘“‘vector’’ and ‘‘matrix.’’ In Chapter 11 we will extend 
the examination of arrays in APL to three and higher dimensions! 

Following the theme of this book, we will relate the APL expressions 
directly to the mathematical operations you have learned, or are learning, 
in matrix algebra. Even if you do not know any matrix algebra at all, this 
book will help you to learn some. 

A matrix is a two-dimensional array. Its dimensions, say (n, m) (some- 
times this is written as (n X m), signify that the matrix has n rows and m 
columns. But what if either m orn is 1? If the matrix is 1 x m (one row, m 
columns), it is called a row vector (or often just a vector for short). And if 
the matrix is n X 1( rows, 1 column), it is called a column vector (just a 
vector for short). So you see that when referring to a vector, you have to 
be careful to distinguish between row vectors and column vectors. 

Now a row vector is not a column Vector is not a list! (Remember, a list 
has only one dimension: length.) ‘‘Array’’ in APL is a general term indicat- 


Reshape o 


Ravel , 
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ing a variable with one or more dimensions. An array with one dimension is 
a list; an array with two dimensions is a table. Mathematically, we need to 
be a bit more precise, so we will often have to be careful to distinguish row 
and column vectors from each other, from matrices, and especially from 
lists. 

If you have alist, say A, and you want to make it a vector, either column 
or row, then use dyadic 9. For example, 


A«+ 1234 
A is a list. 
CV<4 1 9A 


CV is a column vector, or a matrix of dimensions (4, 1). 


RV<+1 4 pA 
RV is a row vector, or a matrix of dimensions (1, 4). 
Now type 
A 
1234 
CV 
1 
2 
3 
u 
RV 
123 4 


A, CV, and RY are all arrays. 


pA 
{4 elements in a list. 
u 
pCV 
a {4 rows, 1 column in a column vector. 
oRV 
ai {1 row, 4 columns in a row vector. 
Now try: 
Be ,CV Ravel ‘‘,”’ converts the column vec- 
SB tor CV into a list. 
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De RV Again, the monadic function ravel 
oD converts the row vector RV into a 
list. 


uy 


So now you know that lists and vectors are different, how to get vectors 
from lists (use reshape p), and how to get lists from vectors (use ravel, ). 


10.2 Elementary Matrix Operations 


Our first task is to define some matrices to use as examples. Try 
A232 39123456 
A13«1 391 2 3 
A31+3 103 2 1 
A333 3901 233214 2 3 
B23+2 39123 ° 456 
B31<3 1p4 1 2 
B13<«1 3p4 1 2 


Naming of matrices is arbitrary. You could have used DOG or MOUSE, etc. 
Our name choice was intended to make the dimensions easier to under- 


stand. 
Now that we have defined a number of two-dimensional arrays, we will 


need to consider how to refer to individual elements of an array. Consider 


A33[132] 
2 

A23[2;3] 
6 

Bsal2st) 
1 


That’s easy enough, but what if we want to refer to a whole row or column 
of A33? How could we do that? Try 


A33[13 J 


A23L 32] 


2-9 


Notice that with 433[1; Jand, more strikingly, with 423[ ;2] the array 
returned is alist, not a row or column vector. Thus 


Matrix Addition and 
Subtraction 


10.2 Elementary Matrix Operations 
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pA33[1; J] 
3 
pA23L 32] 
2 
Now try 
ASSC1 23 1 2] oe is the first two rows and 
12 the first two columns of A 33. 
S32 
or 
(25) 
A33[1 3; 2 1] Vian Eivstand 
? (43) 8 2 4 third rows 
- >O 0 : a 
hoO* columns. 
1 2 
or again 
A33[1 2; 3] {Gives the 3rd element in rows 1 and 2. 
cine 


Matrix Addition and Subtraction 
Matrices of the same dimensions can be added and subtracted. 

A23+B23 

O4 6 

0 10 12 

but if you try 
A23+A31 

LENGTH ERROR 
A23+A31 
A 


This is because in APL, addition, subtraction, multiplication, and so on, are 
operations which are carried out between corresponding elements in the 
two matrices. This is nothing more than an extension to two dimensions of 
the idea we met before in adding, subtracting, etc., one-dimensional arrays 
or lists. 
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Multiply Matrix by 
a Scalar 
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Multiplication +.x 
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Multiplying a Matrix by a Scalar 


Multiplying a matrix by a scalar is obviously easy: 


3xA23 
3.6 9 
12 15 18 


and forming linear combinations of vectors is also easy. Try the mathemat- 
ical relation 2(A31) + 3(B31); in APL this is 


(2xA31)+3xB31 
18 
7 
8 


What would happen if we removed the parentheses? Try it. 


Matrix Multiplication 


The mathematical operation of matrix multiplication (to be distinguished 
from the APL operation of multiplication between arrays, e.g., A23 < B23) 
is nothing more than the APL function called inner product (defined on 
page 27) between the rows of the first matrix and the columns of the 
second. The mathematical expression for the matrix multiplication of two 
matrices B23, A33 to give a matrix C is C = (B23) (A33), where C has 
dimensions (2 X 3). 


C#B23+.xA33 


C 
Ti Be 38 
35 14 11 


For example, the [1;2] element of C is the inner product of row [1;] of B23 
and column [;2] of A33. Let’s check that. 


B230131+.xA33L 32] 
8 
or, to obtain C [2;2], try 
B23(2;1+.x433032] 
14 


The APL operation of +.x for obtaining the mathematical operation of 
inner product between vectors or, more generally, ‘“‘matrix multiply,”’ 
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while very useful, is not without some dangers. What if by mistake one 
of the matrix variables is not defined to be a matrix, but is only a list? 
Consider 


BT+4 1 2 {BT is a list of dimension 3. 
GT<A33+.xBT 


GT 
12 16 24 The result of the operation +. 
[between a matrix (3 x 3) anda 
par list is a list of dimension 3. 


3 


The importance of noting the difference between 433+. XBT and A33+.xB31 
is illustrated by the following. Try 


AT<4 5 6 {AT is a list of dimension 3. 
A134CP { ‘in is 1 x 3, but GT has only 
dimension 3. 
RANK ERROR 
A13+G? 
A 
because 
pA13 
1 3 
oAT 
3 
But 
AT+GT {Both are lists. 
16 21 30 
and 


A31+(433+.xB31) 


15 

18 - A column vector (matrix which 
is 3 X 1). 

15 

because 


0433+.xB31 
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0A34 
3 4 


So the rule is: to be safe in carrying out matrix operations, do not mix lists 
with vectors and make sure all your arrays are dimensioned as matrices 
and vectors of the appropriate shape. That is, you should always know 
whether you are dealing with a row or a column vector. This last point is a 
tricky one. Consider 


G+A33+.xB31 
G 

12 

16 

24 


But what if we take the inner product between A33 and B13, which opera- 
tion is not defined mathematically as a‘‘matrix multiplication’’? 


A33+.xB13 
24 6 12 
24 6 12 
36 9 18 


It is unfortunate here that you do not get an error message telling you that 
you cannot multiply a (3 x 3) matrix by a (1 < 3) vector. What happens in 
APL is that each element of B 13 is treated in turn as a scalar, and eachrow 
of A33 is multiplied by that scalar and added up to get the above result. 
Thus 
24=(1+124+3)x4 6=(14+24+3)x1 12=(1+2+3)x2 
24=(34+2+1)x4 6=(34+2+1)x1 12=86+2+1)x2 
36=(44+2+3)x4 9=(44+24+3)X1 18=(444+24+3)x2 


Compare this result with 433+ . =< B31. 


10.3 Transpose of a Matrix 


Matrix Transpose ® Matrices can be transposed. That is, a (m x n) matrix is converted to an 
(” X m) matrix simply by writing all its rows as the columns of the trans- 
posed matrix (and hence all its columns as rows in the transposed matrix). 
The primitive APL function (i.e., it’s on the keyboard!) which does this is 
Transpose, & (key upper shift 0, backspace, and key \ ). Thus 


QB23 
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Transposition of a list does nothing, since a list has length only. Try trans- 
posing AT. 


10.4 A Not So Elementary Operation: Matrix Inverse 


Matrix Inverse 


Identity Matrix 


Singular and 
Nonsingular 
Matrices 


Let’s begin with the simplest situation—a square matrix which is nonsingu- 
lar. A square matrix is one with as many rows as columns. A nonsingular 
matrix is one for which no linear combination of the rows or columns will 
produce a vector of zeros, More formally, a matrix A with rows a,, do,..., 


a, orcolumnsa',a’,. . .,a" where we can find xo list ofm numbers (not zero 
of course) to satisfy either 

a,b, + dgbp + +++ + a,b, = 0 
or 

a'b, + ab, + +++ +a"b, = 0 


is nonsingular. An important and very special example of a nonsingular 
matrix is the identity matrix, J. J looks like this: 


10000... 0 
01000... 0 
00100 0 

0 
0 0 0 1 


That is, J is square and has zero’s everywhere except for the 1’s on the 
diagonal. 

The interesting thing about a nonsingular matrix is that we can always 
find another matrix, say B, which satisfies the following mathematical 
relationship: 


AB = BA=I1 


where AB and BA represent matrix products. B is said to be the ‘‘inverse’’ 
of A. The above relationship between A, B, and / is similar to n(n) = 1, 
where n 1s any nonzero number. So instead of talking about B, let’s com- 
plete the analogy with numbers and write A~' for ‘‘A inverse.’’ We have 
now 


AA = AtA =I 


and if A is a (1 X 1) dimensional matrix, we get the same result that we 
would get if A were a scalar, namely 


aa'=a'a=I 
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You will recall that, for a a number, a~ is called the reciprocal, or multi- 
plicative inverse. 

With numbers we know that +¥ gives DOMAIN ERROR if Whas the value o. 
The matrix analogue to N = Ois a singular matrix; singular matrices do not 
have inverses. Consider a few simple matrices: 


1 0 is nonsingular 
1 

1 3 is singular 
6 

1 2 3 is singular 

4 5 6 

7 8 9 

12 3 is nonsingular 

4 6 5 

8 7 9 


Try adding together combinations of rows (or of columns) in order to get a 
zero vector. More easily, see that you cannot find a combination of two of 
the rows (or columns) which equals the negative of the third for the non- 
singular matrix. 

This is all very well, but how can we get the inverse of a matrix in APL? 
Fortunately, we have another primitive function called ‘‘quad-divide,”’ 
‘‘domino,”’ or ‘‘matrix-divide.’’ It is the monadic , keyed in by upper shift 
Lbackspace, key +. Consider the following matrix. 


A+2 205 00 2 


A 
5 0 
0 2 
0.2 QO 
0 0.5 
A+. x4 
1 0 
0 1 
(GA )+. x4 
0 


Left and Right 
Inverse of a 
Nonsquare Matrix 
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Let’s calculate the inverse of A33 and check that it is in fact the inverse. 


F433 
“3.33338 1 “1, 30648 16 3.3333E 1 
4.1667F 1 7.50008 1 “6.6667F 1 
1.6667F 1 “5.0000F 1 1.0000E0 
A33+. E433 
1.0000F0 3.1345F 16 “?.6368F 16 
5.5511 17 1.000080 9.7145H 17 
u,4u09H 16 ~7.8469H 17 1.0000#0 


As long as we recognize that numbers on the order of 10~'* are essentially 
zero, we see that we did, in fact, get the inverse of A. Try (# 433) 
+.x433 on your own. From these examples you see that the results of the 
operation of taking the inverse may be only approximate and not exact. In 
any case, we see that fH 433 is a very good approximation to A33~', the 
inverse of A33. 

Domino, or quad-divide, just like + for numbers, also has a dyadic use 
that we will discuss further in a moment. For now, note that all you needed 
to do to check that (& 433)+.x433 is approximately J; was to type 


A33H1A33 
1.0000F0 8.7093E 17 2.7911F 16 
2.22148 16 1.0000F0 “2.4126F 16 
“1.3362F16 1.7367F 16 1.0000E0 


If you are wondering why you get slightly different results in the two cases, 
the answer is that this is due to the use of different algorithms (computa- 
tional procedures) for using — dyadically and using +. with monadic —. 

This is simple enough if you have been reading some matrix algebra, but 
&]in APL enables you to consider something a little more mathematically 
tricky. We just reminded you that for a square matrix A, A~! is a matrix 
which satisfies two mathematical conditions: 


(a) AJTA=TI 
(b) AAT=T7 


where / is the identity matrix of the same dimensions (square) as A and, of 
course, A~'. But what if we have a nonsquare matrix, say A, whichis (m x 
n), and there exists a matrix such that only one of the conditions holds? We 
can define a left and a right inverse, say A,, and A p,, depending upon which 
condition is satisfied. It is only when A is square and nonsingular that both 
left and right inverses exist and are equal (i.e., A,;; = Aj) so that we can 
talk about the inverse of A. In this case A ~! = A,, = Ap;. Thus if we can find 
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a matrix Ap, so that 


A +.X Ap = I 
(m x n) (nx mm) (mx m) 


then A py, is the right inverse of A. Similarly, if 


Au +.X A = I 
(n X m) (mx n) (nx n) 

then A,; is said to be a left inverse (it multiplies A from the left). 

Quad-divide applied to a nonsquare matrix A will yield the /eft inverse of 
A if that inverse exists. A left inverse of A can be found if the columns of A 
are linearly independent vectors, which means that no column can be set 
equal to a linear combination of the other columns. If the columns of A are 
linearly independent, A is said to have full column rank. Let’s try it. We 
need a nonsquare matrix with at least as many rows as columns. Let’s use 
the transpose of A23, which gives us 3 rows and only 2 columns. Key in 


9423 
14 
25 
3 6 


8423 is of full column rank because one cannot find a number C such that 


1 4 
2/C=15 
3 6 
Now try 
EIQA 23 
“O. 94444 “O.11111 0.72222 
O.4444u4 0.11111 “0.22222 


If this is to be a left inverse, premultiplying 9423 by 8423 should give /,: 
(HQA 23) +. xQ423 


1.0000F0 1.3323F 15 
6.9389F 17 1.000020 


Here we see that &&A23 would give us the right inverse of A23. From 
this you will realize that the right inverse of A23 is the transpose of the left 
inverse of the transpose of A23. Try 


A2 385A 23 


1.0000F0 6.9389F 17 


1.3323F 15 1.00000 


System of Linear 
Equations 


Linear Regression 
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You will notice that the matrix result printed out is the transpose of the 
previous matrix result. 


Systems of Linear Equations 


Now that we are equipped with quad-divide, we have an easy way to 
solve linear equations and to obtain linear least-squares estimates of coeffi- 
cients in a much simpler way than the approach of Chapter 9. Consider first 
the linear equation 


Y = XB 


Where Y is an (n x 1) vector, X is an (x X m) matrix, and B is an (m xX 1) 
vector. Clearly Y is obtained as the inner product (mathematical operation 
of matrix multiply) of X and B. Suppose we know X and Y but not B and, 
perverse creatures that we are, we want to know B as well. We now see 
that the solution is apparent. If X has more rows than columns (n > m) and 
X has full column rank (rank of X = m), then all we need is the left inverse 
of X, say X,,. Thus if Y = XB, then X,, Y = X,,XB = IB = B. The way to 
write X,,¥ in APL is YX using the dyadic form of quad-divide or, more 
obviously but less elegantly, (fx) +.x Y. 

Let’s create a Y vector. Let X be the transpose of A 23 and let B be given 
by B<2 3. Then Y is defined by Y = X’B or, in APL, 


B<2 3 
Y<( X<QA23)+.xB 
Y 


14 19 24 


Remember, Y is a list, not a vector, because B was defined only as a list. If 
we solve for our known list B, we have 


Yelx 
2 3 


Will this approach work in the least-squares analysis of linear regres- 
sion? Consider the regression model we solved in Chapter 9: 


Y=at+bX+U 


where Y and X are observed lists, U is an unobserved list, anda and b are 
the coefficients to be estimated. If we have n observations on Y and. X, the 
model can be written in matrix form as follows: 


Y=ZB+U 


where Y is an (n X 1) vector, Z is the (m X 2) matrix shown below, B is a (2 
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x 1) vector with elements (a, b), and U is an (n X 1) vector of unobserved 
error terms. Z is given by 


1 x, 

1 XxX 
Z= 

1 x, 


If we use our current approach, how might we solve the problem? Well, 
Z has more rows than columns, so what do we get if we use the left 
inverse? When we multiply 


Y=Z7B+U 

by Z,, we get 

and if we feel justified in ‘‘ignoring’’ the term Z,,U, we have solved 
our problem: our estimate of B is Z,,Y. But what of the ‘‘least-squares”’ 
solution, and how does it compare to our dyadic quad-divide? The 
least-squares approach to the problem is to define the estimator B of 
B by 

B=('Z)'Z'Y 


where the symbol Z’ means the transpose of Z, Z'Z is a square matrix, and 
(Z'Z)~' is its inverse (both left and right). Before worrying about the statis- 
tical meaning of this, consider the term (Z' Z)"'Z’ ZB. (Z' Z)~' is the inverse 
of Z'Z, so (Z'Z)'Z' ZB = B, but this in turn means that (Z’Z)~'Z’ is aleft 
inverse of Z! We have, therefore, reached the inescapable conclusion 
that YEZ, which produces the inner product of the left inverse of Z with 
the vector Y, gives us the least-squares solution. To further establish this 
result, let’s redo our calculations from Chapter 9. First we set up the 
data: 


Y«55 70 90 100 90 105 80 110 125 115 130 130 
X<«100 90 80 70 70 70 70 65 60 60 55 50 

Z2<12 201 

ZL 321«X 


and now all we have to do is to type 


B<YHZ 
B 
210.44 ~1.5778 


which is exactly the same solution we got in Chapter 9. 
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Scalars: single numbers, with no dimension. 

Lists: variables that have one dimension—length. 

Arrays: variables that have at least one dimension. 

Matrix, ortable: an array with two dimensions (rows andcolumns). 

Vector: a special case of a matrix—a column vector is a matrix 
with one column, a row vector is a matrix with one row. 

Reshape, dyadic use of p: reshapes an array according to the 
specification of the left argument; e.g., Jo A where A is an array and/ isa 
list of integers reshapes A according to the dimensions specified in J. 

Ravel, monadic use of ‘‘.’’: converts the argument (array, list, or 
scalar) into a list. 

Indexing of arrays: ALA:8;C;] indicates that the ath plane of the 
bth row of the cth column is being referenced; or ALA; ;C;] refers to the 
ath plane and cth column for all rows. 

Arrays and arithmetic functions: A f 8 produces an array C, each 
of whose elements are defined by the functional relationship f between the 
corresponding elements of A and B. A and B must have the same dimen- 
sions, 

Matrix multiplication: obtained in APL by use of the inner prod- 
uct function +.x; ¢.g.,4 +.x B, where A and B are matrices, produces an 
arracy C whose elements are given by the mathematical operation of ma- 
trix multiplication. 

Transpose, &: keyed by upper shift 0, backspace, \. In its 
monadic use it alters an array so that its dimensions are reversed. If p 4 is 
3.5 2, p&Ais 2 5 3. 

Matrix transposition, A'(or 47): given by 94. 

Matrix inverse and identity matrix: The identity matrix, J, is a 
square matrix with 1’s on the diagonal and zero’s elsewhere. A square 
matrix A has an inverse A! if AA~! = A7!A = J, where A“'A represents 
matrix multiplication. A matrix is said to be nonsingular if its inverse 
exists. 

Quad-divide (domino), f}: keyed by upper shift Z, backspace, + 
(monadic use). If A is an array of two dimensions with the first at least as 
large as the second, then & A produces the left inverse of A, Aj); e.g., 
A, +.x4=I, where the dimensions of J are the same as the second dimen- 
sion of A. 

Quad-divide (dyadic use): Solves linear equations. For example, 
if you wish to find the array X such that AX = B, where B is alist andA isa 
matrix, then_X is given by BEA. The result BEV is equivalent to (E4)+. xB. 
In a linear regression Y = XB + U, the estimator for B can be obtained by 
YX, which produces a result equivalent to the mathematical expression 
(XX) YY. 
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Exercises 


APL Practice 


1, 


Let A<4 4 p15 

(a) Write the APL statements to select the first column of A? 

(b) Write the APL statements to select the second row of A? 

(c) Write the APL statements to replace all the elements that are 
equal to 5 with the number 6? 

(d) Write the APL statements to instruct the computer to multiply 
the first row by the last row element by element? 

(e) Write the APL statements to express each element of the matrix 
as a percentage of the largest element of the matrix? 

(f) Suppose you want each of the 4 column averages and each of the 
4 row averages. Write a routine to do this. 

(g) Write the APL statements to select the two by two middle block 
of the matrix A? 


Let A«2 2p 26 16 9 6, B+2 2 03 512, C26 9 , and D<1 ? 
016 6. Examine carefully the results of: 


(a) HB (i) CHB 

(b) B E (Gj) ( EB A)+.xA 
(c) H QB (k) AHAB 
(d) & ] Band compare to (c) (l) At.x BA 
(e) DAC (m) CHHAB 
(f) AC (n) ABA 
(g) ABB (0) 2 & 2 

(h) (,D) HB 


Solve the following systems of equations. 
(a) 3X, + 5X, = 26 
X,+2X%,= 9 
Compare this solution with the one for 2(i) above. 
(b) 3X, + 5X, = 16 
X,+2X,= 6 
Compare this solution with the one for 2(h) above. 
(c) Compare the results of both (a) and (b) with those found in 2(g) 
above. 
Find the left inverse of the matrix 2«3 2 63 5 1 2 4 5 and verify 
that ((HZ)+.*Z) = (QZ)+. «QZ . 
Consider the matrix W+3 21 2 2 4 3 6,which does not have full 
column rank. Use APL to verify that W does not have a left inverse. 


Consider the following system of equations: 
6X, + 4X, + 3X; = b, 
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20X, + 15X, + 12X3 = b, 

15X, + 12X, + 10X, = b, 

(a) Find the values of X,, X,, and X,; if b, = 13.1, b. = 46.9, and 
b, = 37.1. 

(b) Suppose you round the values of b,, b,, and b, to the nearest 
integer, i.e., b, = 13, by = 47, and bz = 37. Solve the system 
using the rounded values for the vector of b’s. Compare the solu- 
tion values obtained to your answer to (a). This exercise is an 
example of ill-conditioning; i.e., small changes in the values of 
the numbers in the problem lead to large changes in the solution 
values. 

7. Consider the following system of equations. 
5X, + 3X, = 
2X, + 3X, = 5 
3X, + 4X, = 7 
Let: 


joel 


(a) Compare the solutions to the following operations. 
1. (EB)+.xc 
2. C HB. 
3. (QC) +. xQHB 
Comment on the results. 
(b) Try the following operations: 
1. & 8 The left inverse of B. 
2. —} & B The inverse of the transpose of 2 does not exist. 
3. @E&B The same as in 2. 
4. & & B The transpose of the inverse. 
Thus for any matrix A ,,, with n < k the left inverse does not exist 
and the right inverse is 8H 4 while if n > k the right inverse does 
not exist and the left inverse is —] 4. 


| 
“I 


8. Using the logical function equal (=) and the matrices given in exer- 
cise 2, verify the following properties. 


Mathematically APL Explanation 


(a) B=(B')! B=88 B The transpose of the 
transpose is equal to 
the original matrix. 

(b) (AB)'=BtAt ( @4+.xB)=(9B8)+. x04 The transpose of a 
product is equal to 
the product of the 
transposes in reverse 
order. 


(c) (A+B)=B+A (A+B)=B+A Associative Law. 
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Mathematically APL Explanation 
(d) 3(AtB)=3A43B (3x (A+B) )=(3xA)+3xB Distributive Law. 
(e) (B! y tete ( —& B)-a B The inverse of the 


transpose is equal to 
the transpose of the 


inverse. 

(f) (a ) “ll B -HE B The inverse of the 
inverse is equal to 
the original matrix. 

(g) ips tog ta (H4+.xB)=(H B)+.xHA The inverse of a 


10. 


product is equal to 
the product of the 
inverses in reverse 
order. 


It is often the case that you need to check if the lim A”, where A is 


any square matrix, exists. In APL everything is easy. Consider 
the very simple function. 


VCONV A 
[1] A+A+.xB<A 
[2] 2341xx104+/+/A=B)#4x/pA V 


(a) Use the program to verify that the limits of the matrices XX and 
ZZ, when raised to the n power, are the Zero Matrix. 


0 1 0 0.1 0.2 0.3 
XxX ={|0 0 1 ZZ =/0.3 0.1 0.4 
0 0 0 0.3 0.2 0.5 


(Hint: Use the trace operator to see if the matrix in fact 
converges.) 


(b) How would you change the routine so that it would stop and 
print a message in case the matrix does not converge? 

(c) Can we use the program to check if a matrix is idempotent? That 
is, if A7 = A’A =A thenA is idempotent. 

(d) Change the program so that it will calculate the inverse of J — A 


using the formula (1 — A)"! =7+ A + A? + A? + A*4+---: in 
case A’s limit is the zero matrix. 
Let 
1 1 
X={1 2 
1 3 


Find A = I — X(X'X)"'X’, where /is the identity matrix, and verify 
that A is an idempotent matrix (i.e., A? = A). 
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ll. Using 
1 1 
X=!1 2 
1 3 
and 
1 2 
R=|2 4 
3 6 
find: 
(a) (Xx)! 
(b) (R'R)"! 


(c) [R(X’X) | RT" 

12. In the following input-output model let a;; be the input of product i 
per unit-volume of output of product j 0 < a, < 1, X; be the total 
output of product /, and let C; be the final demand for product /. 
Suppose we have only two products X, and X,. The input-output 
equations are 


1X4 + Aga Xo + Cy = Xe 


Given the matrix of input-output coefficients 
A= & 05 | 
0.4 0.2 
find the total production of X, and X, that will meet a final demand 
of C, = 20 and C, = 30. 


Statistical Applications 


1. Given the one equation model where the coefficient of X is known 
Y,=3X,+U, and X= [3 


Consider the efficiency of the following two estimators of the 
coefficient of X. 
S?Y, DiY,X, 
a, = SPY, and az = SEXP 


Prove that: 
a. Var (a,) < Var (az), i.e., a2 is more efficient than a, if the U,’s are 


independently distributed. 


1 with probability .5 


and Uy, = ni with probability .5  ¢ = 1,2 
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b. Var (a2) < Var (a;,), i.e., a, is more efficient than a, if the U,’s are 
not independently distributed, but have the following joint discrete 
probability distribution. 


U, U;, Probability 
(1,1) 1/10 
d,-1) 4/10 
(-1,1) 4/10 


(—1,-1) 1/10 


IMPORTANT: In appendix E you will find two sets of data named 

MACRO and WATT, as well as a detailed explanation of the symbols that 

will be used. You are advised to store these data sets into your file because 

they will be used for most of the exercises in this and the following chap- 
ters. 

2. Let C= a+ bY be the familiar Keynesian consumption function 
where C is the total consumption expenditures and Y is the GNP, both 
given in appendix E. Use the data from 1950 to 1978 to estimate the 
marginal propensity to consume, 5, and the average propensity to 
consume. Comment on the relationship between them. 


3. Another relationship that you probably learned in your macro- 
economics courses is that imports (IM) are positively related to 
GNP. Use the data from the data set MACRO (appendix E) to see 
which of the following equations better describes the relation between 
imports and GNP. 

(a) IM = a+ 6b GNP 
(b) IM = a + b GNP? 
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So far we have avoided large arrays of data and complicated statistical 
problems in an effort to learn the basic and easy procedures first. How- 
ever, we are now getting to the stage where we can branch out and be more 
adventurous. You will soon see that as the complexities of the statistical 
problems increase, and as the amount and variety of data increase, we will 
need to develop new mathematical tools and hence new APL procedures 
to handle them. 

The mathematical tool which is most heavily used in statistics is matrix 
algebra; it was introduced in the last chapter. What we need to do now is to 
learn to exploit the power of APL in solving a variety of data-handling 
problems and complex statistical procedures. It is to these issues that we 
now address ourselves. 


11.1 Reduction Function 


Reduction / 
Arrays and Indexing 


{In Chapter 9, when we were calculating the cell frequencies for a con- 
tingency table, we saw the need for the use of reduction across both rows 
and columns of a matrix. In fact, we can make the reduction function even 
more useful. For example, we might have a list of tables and want to get 
the average value of the (i, /) entries across all the tables. In APL we have a 
direct way of doing this, as we shall now see. 

Recall from Chapter 9 that the reduction function, /, operates on only 
one dimension of a multidimensional array at a time. For example, the use 
of +/on a matrix X of dimension 2 x 3 is 
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+/X 
6 15 


The summation has proceeded across the /ast coordinate of the matrix. That 
is, each row is reduced by its columns. Another way of indicating the 
coordinate over which reduction is taken is to specify the coordinate 
explicitly. For example 


(+/[2]X) 
6 15 
and 

(+/£11X) 
5 7 3 


In the latter case, the reduction is over the first coordinate; i.e., each 
column is reduced along its rows. In this case there is an alternative proce- 
dure. By overstriking the reduction operator (/) with the subtraction 
symbol, producing 7, we can also obtain reduction over the first coordi- 
nate. 

But when we have three dimensions, as in 


X<2 22 ox 
x 

12 

3.4 

5 6 

12 


where X is composed of two planes of two rows by two columns each, the 
situation is more complicated. In our example with X we have 


Columns 
1 


Plane 1 { ; 


Plane 2 {i 


<— Rows 1, 2 


<— Rows 1, 2 


NA BNI N 


The number ‘‘6’’ is in the second column of the first row of the second 
plane. 
To find sums across the third coordinate, we could use 


(+/X) 


11 3 


Expand \ with 
Arrays 
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(+/(3]X) 
3. ¢«7 


11 3 


which results in four sums—addition over each row in each plane. 3 is the 
reduction of the first row in the first plane, 7 is the reduction of the second 
row in the first plane, and so on. 

However, the main advantage of the use of the index notation with 
reduction is not merely the provision of an alternative to +/ (reduction over 
the /ast coordinate) or to +/ (reduction over the first coordinate), but that it 
enables us to get reduction over the middle coordinate. Suppose that we 
want column sums in each plane. We obtain this by 


+/L 21K 
4u 6 
6 8 


Reduction over the first coordinate is obtained by 


+/[1 1x 
6 8 
4u 6 
or by 
+7X 
6 8 
4 6 


which gives us the sums across matrices of the (i, j)th elements in each 
matrix. 

Any dyadic element can be used with the reduction operation. However, 
with — and + you have to consider carefully what the results will be. For 
example, (-/1 2 3 4) willproduce1 -2 -3 -4, andin APL the value of 
that expression is 2. Also (+/2 4 6 8) becomes 2 ? 4 + 6 + 8, which 
is equal to (2 x 6) + (4 x 8) in APL. So you need to be concerned 
about both the coordinate over which the reduction takes place and the 
meaning of the reduction itself. 

Recall the scan instruction, \, which is similar to the reduction instruc- 
tion. Scan has the same general form as reduction, and the same rules 
about index coordinates apply. In ‘‘sum-scan’’ we would have 


+\1 234 
which would result in 
13 6 10 


The general form of scan is FN\VAR, where FN is a primitive dyadic 
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function and VAR is a vector or matrix. The dyadic function FN is placed 
between successive additional elements of VAR. For example, 


+\1 2 3 4 
would be 
1 (142) (414243) (1+2+3+4) 


The last element is exactly the same as that which you would obtain with 
the ‘‘sum-reduction,’’ +/1 2 3 4. 

Let’s consider a practical example of the use of scan with three- 
dimensional arrays. Suppose that you had a three-dimensional array of oil 
production data. The planes represent different nations, the rows are oil 
pumping locations, and the columns are quarters of the year. We wish to 
know the cumulative totals by quarter for each site and each nation. For 
expository purposes, let us reduce the number and size of the matrices and 
use hypothetical values. We can construct the data by use of the roll 
function, ?, which generates random numbers (see Chapter 4). 


PRODUCTION+3 5 4 60760 


We will pretend that the above represents the quarterly output of three 
nations at five sites in each nation for four quarters. In order to generate 
the cumulative sums by quarter, we use +/ on the array PRODUCTION, since 
quarters are the elements in the last dimension: 


In schematic form these data are: 


PRODUCTION Quarters of Output 

y y y 1 

53 15 32 27 . . . . 
44 60 18 17 . 
458 25 9 N,- 


1355 3 6 . : . - Sy 
54 54 12 20 Three : . . - §$,— Output 
10 38 8 45 Nations WN, - , : * S3— at each 
43 30 36 35 . : . - Sy site 
37 1 14 16 - Ss 


48 39 2 50 Nz ° 


33 52 29 49 QO; Q; Q; 0, 
t t t t 
Quarters of Output 
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+\PRODUCTION 
53 68 100 127 A matrix for each nation. 
uy 104 122 139 Each row represents a site. 
4 62 87 96 Each column contains the accumulating 
47 92 144 173 partial sums of quarterly output. 


56 143 153 177 


13 68 71 77 
54 104 117 137 
10 48 56 102 
43 73 109 144 
37. 38 52 68 


23 64 69 88 
4B 87 89 139 

7 38 59 93 
42 68 96 107 
33 85 114 163 


Let’s look at the first nation and first site. We have 53 units of output for 
the first quarter, (53 + 15 = 68) units in the second, (68 + 32 = 100) in the 
third, and (100 + 27 = 127) in the fourth. Each of these rows yields our 
quarter-to-date totals, and each matrix represents the cumulative quantity 
output by sites for each country. 

Now, if we want the total quarter-to-date figures for each country (i.e., if 
we want to add up over sites for each country), we type in 


+\(+/[2]JPRODUCTION) 
204 439 576 712 <— Cumulative quarterly outputs for nation 1. 
157 332 405 528 <- Cumulative quarterly outputs for nation 2. 
153 342 427 590 <— Cumulative quarterly outputs for nation 3. 


Reading from the right, we have summed down the columns of each plane 
and then performed ‘‘sum-scan’’ on each of the three vectors created by 
the reduction down each column. 

Suppose that the second dimension represented classifications by types 
of oil rather than locations, and we wanted to compute the quarter-to-date 
sum for each of these five types across nations. 


+\(+/[1JPRODUCTION ) 
89 200 240 292 
146 296 328 415 
21 148 202 291 
132 233 319 424 


126 236 319 408 
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+/[1] PRODUCTION gives the totals across nations by type of oil and by 
quarter. +\ applied to the resulting matrix gives the cumulative sums. 
Thus 89 = (53 + 13 + 23), and 200 = (89) + (111) = (5 + 55 + 41). In order 
to give you a better appreciation of the flexibility of these operations, 
consider the following alternatives, which are all equivalent: 


A<+\(+/01JPRODUCTION ) 
Be+\C1)(4+/01 PRODUCTION ) 
C++\01](+#PRODUCTION) 
D<+\(+4PRODUCTION ) 
The equivalence of these four alternatives is verified by 


A=B 


de de oP 
1iai 


A problem which statisticians frequently encounter is that of selecting 
subsets of observations. Suppose that you have a list of tables of statis- 
tics; for example, you are looking at annual observations of GNP (gross 
national product) statistics for a series of countries. Now while the com- 
plete list may contain as much information of this type as you would ever 
wish to use, and so is potentially a very useful data source, in any particu- 
lar problem you may want to look at only a few variables, or a few observa- 
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tions, or only a selection of countries. Let’s see how we can select vari- 
ables from an array of several dimensions. 


Dyadic Use 


of Compression 


The compression function is the dyadic use of the reduction operator /. 
This operation requires a vector of 1’s or zeros to the left of the ‘‘slash.”’ 
Here is an example. 


1010/1257 
15 


What does it do? It ‘‘compresses’’ the vector 1 2 5 7 to 1 5, by dropping 
those elements from the right array which are matched by zeros on the left 
side of the compression symbol. This means that we must have the same 
number of elements on both sides of the./ symbol. If we violate this rule, 
we get a length error. 


101/135 7 
LENGTH ERROR 
1oit/1i135 7 
A 


If X is a three-dimensional vector defined by 


X*2 2 2 018 
Xx 

1 2 

3 4 

5 6 

7 8 


we can compress this matrix by using the same rules we developed for 
scan and reduction. Here are a few examples: 


0 1/X 
2 Compression according to the 
4 last coordinate, columns. 
6 
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0 1/f31x 
2 An alternative way of doing the 
4 same thing. 
6 
8 
O 14X 
5 6 Compression according to the 
78 first coordinate, planes. 
0 1/C1Ix 
5 6 An alternative way of doing the 
same thing. 
7 8 
O 1/{21x 
Compression according to rows, 
3.4 . . 
that is, according to the second 
7 8 coordinate. 


Let’s consider an example. Suppose that we have annual observations 
on GNP statistics listed as columns in a two-dimensional table, one table 
for each country. This mammoth variable is called GNPSTAT. You want to 
run a regression between, say, consumption and income, which are in 
columns 5 and 26 for countries for which the index numbers are 8, 9, 22, 
and 38. Also, you have decided to delete the war years from your analysis; 
these years have index numbers 40 to 45. Let’s store our subset of data 
into CYTRFAL. This could be achieved by the procedure outlined below. 

We need three arrays to compress GNPSTAT. Let’s label them CO for 
the country compression, VAR for the variable compression, and WAR for 
the war years. Suppose that there are N countries, M variables, and T 
years altogether. Then we type in 


VAR<M 00 
V4R(5 26]+ 1 1 
CON pO 
COL8 9 22 38] 11141 
WAR<T pot 
WARL4O+0,15]« 6 pO 
and now we can obtain the variable we want by 
REGVAR<CO/L1JWAR/L2)VAR/(C3 IGNPSTAT 


As a numerical example of this, consider the variable X above and 
suppose that M = N = T = 2. We want the first ‘“‘country,’’ the second 
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year, and the first variable; in this simple example that produces the num- 
ber 3. Type in 


VAR« 1 0 
CO+ 1 0 
WAR+ 0 1 
REGVAR*«CO/(1JWAR/[ 2 ]VAR/( 31x 


11.3. Expand Function 


Dyadic Use of 
Expand \ 


Deletion of variables naturally has its complement in the insertion of 
variables. Suppose for example that we have a matrix of data (i.e., a table 
of variable values) and we decide that we want to add some more variables 
to the array—not just tacked on as it were, but added into a specific place 
in the array. The way in which this can be done is by use of the dyadic 
expand function, \. 

The expand function is analogous to the compress function. It has the 
same general form. It requires an array of zeros and 1’s to the left of the 
expand operator (\). But with expand, the number of 1’s in the left vector 
must be equal to the number of elements in the coordinate of the array to be 
expanded. Whenever a zero occurs in the /eft array, a zero is placed in 
sequence in the expanded array between the elements of the array on the 
right. Here is an example: 


x 


10 1\X 


Expansion by columns (last 
coordinate); 0’s are placed 
506 between columns as indicated 
by the array 1 0 1. 


10 14x 


12 { Expansion by planes (first 
coordinate). 
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ales Ogee eZ 6 


12 Expansion by rows (second 
coordinate). 


0 0 
7 8 


As you can see, a zero in the left argument inserts a coordinate of zeros in 
the expanded array. 

If we follow up the GNPSTAT example in §11.3, we can see that if we 
wanted to insert GNP statistics for a whole country we would first make 
room for them by 


We Ar ae ge SOD OD es ces PNET IGN PSTAL 
We would make room for a new variable for all countries and years by 
Tt ot ee OO. ee wy od ANE GARE TAL 


In order to fill in years of observations originally left out of the data set we 
would enter 


Both Aa we aw TOMOKO eA ew LO ONT DICHESTAT 


In this last example we have made room for two rows of data to be added 
at the bottom of each table; in short, we have made room for two further 
years of observations when we manage to get them. 

Frequently, when we acquire a data set, the order of the variables or the 
arrangement of the tables of entries may not be convenient for our pur- 
poses. Consequently, it is often most useful to be able to reorder higher 
dimensional arrays—to rearrange them into a more suitable format. The 
next couple of functions help us to do just that. 


11.4 Reverse or Rotate Function 


Rotate © 


In its monadic form, ¢1 2 3 4 produces4 3 2 1. The character > is 
produced by overstriking the circle (upper shift alphabetic 0 ) and the res- 
idue | (upper shift /). If the vector on the right of ¢ is a multidimensional 
array, the function obeys the same rules that we have discussed concern- 
ing the expand, compress, scan, and reduce operators. The following ex- 
amples will illustrate the use of 9. 
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xX 
1 { Reverses the order of columns, 


the /ast coordinate. 
3 


5 


7 
twas column 1 


twas column 2 


o[ 31x 
1 An alternative way of doing the 
same thing. 
3 
5 
7 
OL 2)X 
R , 
4 <-was Row 2, Plane 1 everses the order of rows, the 


second coordinate. Notice that 

2 <was Row 1, Plane 1 | row reversal is within planes, 
not across planes. 

8 <was Row 2, Plane 2 


6 <was Row 1, Plane 2 
ol1dx 


6 <—was Plane 2 Reverses the order of planes, 
the first coordinate. 


8 


2 <was Plane 1 


4 


Ox 
6 An alternative way of doing the 
same thing. 
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In this last example, the overstriking of the circle with the subtraction 
sign instead of the residue sign indicates that the operation is to be over the 
first coordinate instead of the last. 

The two-argument or dyadic form of rotate allows you to specify the 
‘‘amount of rotation.’’ By this we mean that you can reposition the ele- 
ments of the array by specifying the rotation in the array to the right of 4. 
As an example, we rotate 4 2 | 6 two positions to the left by 


Symbolically we have: 
204 2 1 6 4216! Ist rotation step 2 1 6 4 


164 2 2164! 2nd rotation step | 6 4 2 
) 


4216 Produces 1 6 4 2 


We can rotate in the other direction by 


“304 21 6 Vy 216¢ 


2164 


Here we have rotation to the right by three spaces. Here is how that 
worked: 


~ 6 4216 
164 2 1 1 


6421 
“opu 2146. 6421 
| en 
1642 
~ 1642 
364 21 6 t 
2164 


Perhaps an easier way to see these operations is to consider putting the 
numbers down in a circle with positions marked, and then to rotate the 
inner ‘‘dial’’ of numbers. Thus 264 2 1 6 means rotate two positions to 
the left, and if you rotate the inner dial two positions to the left (counter- 
clockwise) you will get 1 in position 1, 6 in position 2, etc. 


(C) 
3 


If you find the clock approach handy, use it. 
In short, rotate merely rotates the array of numbers in sequence, to the 
right with a negative argument, and to the left with a positive argument. 
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This procedure is also called a cyclic shift, because although elements are 
rotated, they are not interchanged. 
Just as a reminder that ~1 and —1 are not the same, you might try 


-io4 216 
2 14 6 4 


Multidimensional arrays can be rotated along a specified coordinate. Or, 
you can indicate the ‘‘amount’’ of rotation for each row, column, or plane 
within a coordinate. For our example, we will use the three-dimensional 
array Z generated by 


22 34 9 124 


Z Row numbers 
12 3 4 (1) 
5 6 7 8 (2) 
9 10 11 12 (3) 
13 14 15 16 (1) 
17 18 19 20 (2) 
21. 22 23 24 (3) 


(1) (2) (3) (4) 


Column numbers 


262 Rotate columns (last 
coordinate) to the /eft 
by two positions. 


19 20 17 18 
23 24 214 22 


(3) (4) (1) (2) 


Column numbers 


Next, we rotate the second dimension (rows) one place to the ‘‘right’’ 
which, for rows, is down, and notice that the rotation of rows is within 
planes. (Rotation of columns is also within planes, but the distinction is not 
apparent.) 
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“1O{ 242 


9 10 11 12 
1 2 3 4 
Bee Gh oP 8 


21 22 23 24 
Tie SES: 16: 
17 18 19 20 


We can reverse the planes with any of the following commands: 


166112 
13 14 15 16 
17 18 19 20 


21 22 23 24 


1 2 3 4 


17 
2d 


7 


8 


11.12 
“16[142Z 


15 16 


19 


23 


20 
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17 18 19 20 


21 22 23 24 


oD 6 38 


You will gain much more familiarity with this function in the exercises at 
the end of the chapter. For now, it should be clear how you can rearrange 


Ly 


12 


Row numbers 


(3) 
() 
(2) 


(3) 
(1) 
(2) 


<— Plane (2) 


< Plane (1) 


<— Plane (2) 


<— Plane (1) 


<— Plane (2) 


<— Plane (1) 
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your data into a more convenient format. In matrix algebra the operation of 
trans position of matrices and vectors is the most useful of all matrix opera- 
tions. We now examine APL’s development of this basic, but simple, 
notion. 


11.5 Transpose Function 


Transpose ® 


The transpose function will alter the shape of your data matrix. The trans- 
pose symbol is constructed by overstriking the circle (above the alphabetic 
O) and the reduction operator \, forming &. The two-dimensional case is an 
exact analogue to the mathematical operation of transposition. For exam- 
ple, 


¥e2 3 p111222 


Y 
111 
22 2 
oY 
2 3 
Qy 
1 2 
1 2 
1 2 
p(@Y) 
3.2 


In mathematical notation & Y is written as Y’ or Y'. The three-dimensional 
array is sometimes confusing: 


QZ 


11 23 
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4 16 
8 20 
12 24 


What has been done here is best explained by considering the shapes of the 
two arrays. The shape of Z is 


pZ 

234 

but the shape of &Z is 
p(&Z) 

4 3 2 


So we see that we have here a natural extension of the idea of transpose; in 
APL terms, transpose yields an array whose dimensions are the reverse of 
the dimensions of the original matrix, i.e.,8Z 2 3 4 are the dimensions of 
QZ. 

The above discussion has shown you how the monadic form of trans- 
pose, 9, works. There is a very important extension of this idea in the 
dyadic use of 8. Remember that in the monadic use, if p(Z) was 2 3 4, 
0 (8Z) is 4 3 2. But what if we want an array of dimensions 3 2 4? We 
obtain such a result with the dyadic transpose and we refer to each dimen- 
sion number by its position. Thus in the first position for Z, the dimension 
number is 2, in the second it is 3, and in the third it is 4. If we type 


1 2 382 


5 6 7 8 
9 10 11 12 


13 14 15 16 
17 18 19 20 
21 22 23 24 


we get back Z itself. 

Now let us consider the more interesting alternatives. If we specify 
3 2 1 & Z, we will get an array of 4 planes of 3 by 2 matrices. An element 
in the [7;J;K] position in Z is put into the [K,J;J] position for the array 
(3 2.1 & Z). For example, 


3.2 182 


4.13 e(3 2182) is4 3 2. 
Elements in the [/;/;K] position 
are transposed to the [K;J/;J ] 

g 21 position. 
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2 14 
6 18 
10 22 
3 15 
7 19 
11 23 
4u 16 
8 20 
12 24 
1 3 2&2 
1 #5 9 
2 6 10 
3 7 11 
4 8 12 
13 17 21 p (1 3 2 & Z)is2 4 3. Elements 
tu 1g 29 in the [7;J;K] position are transposed 
to the [/;K,/] position. 
15 19 23 
16 20 24 
2 1 382 
1 2 3 4 
13 14 15 16 
5 6 7 8 0 (2 13 8 Z)is3 2 4. Elements 
in the {/,J;K] position are transposed 
17 18 19 20 to the [J;7;X] position. 
9 10 11 12 
21 22 23 24 
One way to visualize these changes is: 
Original Coordinate Values 23 4 
Original Index Position 1 2 3 
Transposed Index Position 2 1 3 
Transposed Coordinate Values 3.2 4 


These operations will be particularly important in certain matrix opera- 
tions needed later on in the text. For now, let us note only the more 


208 


Higher-Order Arrays 


obvious benefits of the transpose function and its extension to multiple- 
dimensioned arrays. 

First of all, a very common matrix multiplication needed in statistics is 
given in mathematical terms by 


X(X'X)X’ 


where X’ represents the transpose of X and X is an N X K matrix, N> K. 
The matrix product X(X'X)"\X’ is programmed in APL by 


ReX+.x( El C QX)+.xX)+. xQX 

Let’s try it with the matrix W defined in the following manner: 
Wee 201001414123 
ReW+.x ( AC QW)+.xW)+. xQW 


R 
0.64706 “9.41176 0.25529 0.658824 
“0.41176 0.35294 0.058824 0.23529 
0.23529 “0.058824 0.17647 0.29412 


0.058824 0.23529 0.29412 0.82353 


A more interesting example is the following. Suppose that we have time 
series data on some variables for different countries. The data are averaged 
by country. But what we would like to observe are the data arranged by 
year, rather than by country. That is, the original data contain a set of 
observations on a number of variables by year for each country. We want 
to rearrange the data so that for each year, we have observations across 
countries for each variable. Let’s give a simple example. 

Suppose that the matrix Z we created before represents two countries’ 
data for three years on four variables; Z has 2 planes, 3 rows, and 4 
columns. 


Z Years 
1 2 3 4 (1) 
5 6 7 8 (2) } Country 1 
9 10 11 12 (3) 
13 14 15 16 (1) 
—_- —- — — (2) Country 2 
--—-—-— @ 
, ft T fT 
1 2 3 4 
Variables 


For each year we want 3 planes, a table of entries representing observa- 
tions by country on each variable; that is 2 rows and 4 columns. This is 
achieved by 
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<— Current dimensions 


1 <— Current position 


3 
2 


<— Desired dimensions 
<— and positions 


mem Ne N Wo 
Wh Wp 


So we type 
2 1 3&2 
1 2 3 4 


13 14 15 16 


5 6 7 8 
17 18 19 20 


9 10 11 12 
21 22 23 24 


and get the desired result. 

Another important ‘‘data manipulation’”’ task is to ‘‘combine’’ matrices 
and vectors to form bigger matrices; for example, we might have a matrix 
X of dimension N x M and another matrix Z of dimension N x Q, and want 
to form a new matrix W = [X Z] of dimension N x (M + Q). The next set 
of functions enables us to do just that and much, much more. 


11.6 Ravel, Catenate, Laminate 


66 93 


Ravel > The ravel function ‘‘;’’ can convert a multidimensional array into a list. 
Catenate , Consider the array DATA: 
Laminate ; 
DATA<2 3 4 910+124 
DATA 
11 142 13 14 
15 16 17 18 


19 20 21 22 


23 24H 25 26 
27 28 29 30 
31 32 33 34 
If we ravel DATA we get: 
»DATA 
11 12 13 14 15 16 17 18 19 20 21 22 23 2H 25 26 27 28 29 30 31 32 33 34 
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where we have DATA listed by rows in the first plane and then in the second. 
Essentially, the ravel operation (like all the others that we have discussed) 
operates according to columns within rows within planes, within blocks, 
etc. You can check the shape of DATA and of , DATA by 


oDATA 


e(,DATA) 
24 


To compute an arithmetic average of all the elements of the DATA matrix, 
you could write 


(+/,DATA)+ p ,DATA 
22.5 


The ravel function is monadic. The dyadic form of the function is called 
catenate or laminate. The general form of the command is X,[ Y]Z. When Y 
is an integer, the function is called catenate and when Y is a fraction, the 
function is called laminate. Let’s begin with catenate when Y is an integer. 

Catenation ‘‘joins together,’’ as it were, two arrays along a specified 
coordinate, provided that the lengths of the arrays involved are the same, 
that is, provided that the two arrays are conformable. If this is not the case, 
you get a length error. The operation can best be explained by examples. 

Let us define the following arrays: 


A B C D 
1 2 3 8 9 20 21 22 25 
4 5 6 10 11 
pA oB pc oD 
2 3 2 2 3 
Now try 
A,(21B B,L2dA ‘eee here augments 
1 2 3 8 9 8 9 +t 2 8 the columns 
4 5 6 10 11 10 141 4 #5 6 
eee CT 
A B B A 
9A,L2IB oB,L2)A 
2 5 2 =5 


Catenation along columns means column dimensions are added, 3 + 2 = 5. 
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A,B B,A 
1 2 8 9 8 9 1 2 
{Same result as above 
4 5 6 10 11 10 14 4 5 6 


We notice here that, as we might have suspected, if no coordinate is 
specified, catenation, if conformable, proceeds along the /ast coordinate. 
Now try 


A,lil¢ C,L1]A 
1 2 3}A 20 21 22}C ae 
Catenation in this case 
4 5 6 1 2 3 augments rows 
C A 
20 21 22 4 5 6 
So row dimensions are 
Ci,L1JIA 
pA .Ca Ie pest { eadea, 2s 1=3 
3.3 3 3 
A,C1J1D A,D 
1 2 3 14 2 3 25 A scalar is extended to 
make up an array of appro- 
4 9 6 4 5 8 25 priate dimension for cate- 
25 25 25 nation. 
04,[1]D oA,D 
3.3 2 4 


We see here that scalars can be catenated to arrays and are automatically 
extended for this purpose. 

Examples of data manipulation using the function catenate spring to 
mind. Imagine that you have an array of time series data on a set of 
variables, and that later you acquire data on a further few years of observa- 
tion on the same variables. Catenate enables you to add the new data to 
your old array quickly and efficiently. Another useful example to keep in 
mind is the perennial problem of handling the constant term in a multiple 
regression problem. Suppose once again that you have an array X of data 
on variables to explain the movements in steel prices over time, but that 
before running your regression you want to add the constant term; i.e., you 
want acolumn of 1’s as the first column of your data matrix. Now we know 
how to do this without pausing for further thought: 


X1,[21X 
gives the desired data array. Try this with our simple A matrix: 


A<1,[2]A 
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Lamination , Laminate is quite different from catenate in operation because a new 
coordinate is established. The form of the command is identical to that for 
catenation, specifically X,[Y]Z. But now Y can take on any positive frac- 
tional value. The idea of laminate is to create a new dimension. For exam- 
ple, suppose that you have two matrices of dimension 3 x 4 to be lami- 
nated. You might want 2 planes of 3 x 4 matrices, 3 planes of 2 x 4 
matrices, or 3 planes of 4 xX 2 matrices. Laminate lets you choose. Con- 
sider the two matrices E and F: 


E PF 
A. 2 3 ath 2 3 4 
5P 465 2° 8 25 6 7 .8 
9 10 11 12 79s <1a0.° yt 12 
Gee Loe HF [512 
G A 
12 3 4 od. 2 rac 4 


G, H are each 3- dimen- 
5 6 7 8 5 .6 mer, .8 sional arrays; 2 
planes of 3 by 4 matrices 


9 10 11 12 69> 405 Lash, a2 
wh, e2 we a4 1 2 3 4 
Dw Owl 28 BG ae 8 
JOD ted de2 9 10 11 12 
oG pH 
a ee: 2 3 4 Ga added dimension is 
A A indicated by the + 


In short, specifying L.5]# means that we want to add a dimension in 
‘front.’ Actually, any decimal between 0 and 1.0 would do as well as .5. 
Now, what about something like [1.6]”? Here we want to add a dimen- 
sion between the existing first and second dimensions. This operation will 
give us 3 planes of 4 x 2 matrices. Consider the examples: 


E,{1.61F E,L2.61F 

a 2 3 4 1 wh 
ot 2 3 4 2 22 
5 6 7 8 3 3 

5 6 7 8 4 4 

9 10 11 12 5 5 
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o£,[1.61F 7 eh 

3 2 4 8 ete. 

aN g ) 
10 a 

11 4 

12 1.2 

o#,12.6]F 
sh 2 fee added dimension 
r is indicated by the + 


These examples show us how we can fit two data matrices together in a 
variety of ways. For example, the first case of creating 2 planes of 3 by 4 
matrices might be useful in setting up an array of time series data on a 
group of variables by country. Alternatively, E and F might represent, 
respectively, the labor and capital inputs by year for various production 
plants. In the #,[1.6]F case, each of the three planes created contains 
each year’s labor and capital inputs to the various plants. In the #,[2.6]F 
case, each of the three planes contains the transpose of these data mat- 
rices. 

Let’s complete this chapter by examining the ways in which we can do 
the opposite to the above operations; that is, we have just been discussing 
fitting matrices together into higher-dimensional arrays, now let us see how 
to ‘‘undo”’ that process. 

The functions of ‘‘drop’’ and ‘‘take’’ enable us to select in a straight- 
forward manner from a multidimensional array those elements that we 
wish to use. 


11.7. Take and Drop Functions 


Take + and Drop + 
Functions 


It is possible to extract or delete some of the elements of a multidimen- 
sional array by using the take and drop functions. These functions, like the 
others we have described, will operate on character matrices as well as on 
numeric ones. 

The take function, +, which is upper case Y, extracts elements of an 
array. It has the format A+”, where A elements are extracted from B, 
starting on the /Jeft. For example 


2*"ABCDE' 
AB 
but 

“34 'tABCDE! 
CDE 


which takes three elements from the right. 
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Now consider the three-dimensional array 
B<2 2 3 p'ABCDEFGHISKL' 
B 


122¢8 


DE 


takes the first plane from the first dimension, the first two rows from the 
second dimension, and the first two columns from the third dimension. 

The drop function, +, removes elements from an array or list, just as the 
take function extracts elements. The down arrow ¥ is above the character 
YU on the keyboard. 


3 ¥ 'ABCDEF* 
drops the first three elements ABC, and we get 
DEF 
The code 
“3 ¥ '+/%*A,?6! 
results in 
+/*A 


As you can see, characters other than alphabetic or numeric ones can be 
operated on. The drop function also operates on multidimensional arrays, 
just as the take function does. For example, 


B 
ABC 
DEF 
GHT 
JKL 
{ Drops the first plane, keeps all rows 
102+8 ? 
and drops the first two columns. 
I 
£L 
Drops the first plane, first rows, 
Pi2zy+s and first two columns 


Summary 


Summary 215 


L 
Bg og oe the last plane, the last row, 
and the last column 
AB 
474604 8 : Drops the first plane, last row, and 
keeps all columns 
GHI 
ae he ae nothing, i.e., keeps 
everything 
ABC 
DEF 
GHI 
JKL 


Reduction, /, and Scan, \: across multidimensional arrays oper- 
ates as with lists, but over the Jast coordinate. Overstriking / or \ with - 
(minus sign) yields reduction or scan over the first coordinate. Reduction 
or scan over any coordinate can be chosen by specifying the chosen coor- 
dinate by indexing. Thus if, for example, 0A is 2 3 4 we have: 


+/A is equivalent to +/[3]4, reduction by columns 
+7A is equivalent to +/[.1J4, reduction by planes 
+/C2I4 yields reduction by rows 


Similar remarks hold for the scan function. 


Compression, dyadic use of /: IfA isa list of 0’s and 1’s, andB is 
a list, then 4/B produces a list C of dimension (+/4) which contains those 
elements of B indicated by the 1’s in A. Compression can be used with 
higher-dimensional arrays in exactly the same way as can reduction or 
scan, 

Expansion, dyadic use of \: IfA is alist of 0’s and 1’s, andB isa 
list, then 4\B produces a list C of dimension (4). Then +/4 must equal p28. 
C will contain the elements of B with zeros inserted in the positions corre- 
sponding to the zeros in A. Expansion with multiple-dimensional arrays 
expands the size of the indicated dimension by the number of zeros which 
appear. For example, if pX is 2 2 2: 


1 0 1\Xis equivalent to1 0 1\[3]Xand produces a middle col- 
umn of zeros in each matrix in each plane. 


101\X is equivalent to 1 0 1\[1]X and produces a matrix of zeros 
between the two planes of matrices of X. 
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1 1 0\{€2]X% produces a row of zeros at the end of each matrix in 
each plane. 


Rotate, 6 (keyed by upper shift 0, backspace, upper shift ¥), a 
monadic function: Rotate reverses the order along the indicated dimen- 
sion. For example, 6X reverses the order of columns in X, an array of 
dimension 2 2 2. 


o[2]X reverses the order of rows in the array X. 


o£1]X, which reverses the order of planes, is equivalent to ex 
(keyed by upper shift 0, backspace, MINUS). 


Rotate, ¢, dyadic form: The amount and direction of rotation can 
be specified in the dyadic use of 9. 


2 (list) rotates a list two positions to the /eft. 
“3 (list) rotates a list three positions to the right. 


Dyadic rotate can be used in a manner analogous to the use of scan and 
reduction with higher-dimensional arrays. 


eX is equivalent to $[1]X. 


Transpose, & (keyed by upper shift 0, backspace,\ ), a monadic 
function: § applied to an array with dimensional elements [J/; J; K] yields 
an array with dimensional elements [K,J,/]. 


Transpose, &, dyadic form: If A is an integer array, AX trans- 
poses the elements of X as specified by the elements of A. For example, if 
X has dimensions 23 4, then 2 31 8 X produces an array with 3 planes, 4 
rows, and 2 columns. 


Ravel, ‘‘,’’ a monadic function: converts any array into a list by 
listing the elements along the dimensions, starting with the last and work- 
ing forward to the first. 
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Catenate (or Laminate), ‘*,’’ the dyadic form of ravel: The func- 
tion is used by Y, [41X, where A is the specified dimensional ‘‘index’’ of a 
multidimensional array X. If A contains integers, the function is called 
catenate; if A contains nonintegers, the function is called laminate. With 
catenation, Y and X are joined together to form a larger array whose 
dimension along that indicated by A is the sum of the corresponding values 
for Y and X. The arrays must be conformable. With lamination a new 
dimension is created (see page 212). 

Take, + , (keyed by upper shift Y): dyadic function which extracts 
elements from a list or array. If B has dimensions 2 3 4 ,12 3+ 8 
produces an array from the first plane of B, the first two rows, and first 
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three columns of B. Similarly “1 2 ~3 + B produces an array from the last 
plane of B, the first two rows, and last three columns of B. 


Drop, ¥ , (keyed by upper shift /): dyadic function used in a man- 


ner analogous to take, +, but drops (or deletes) elements from the array 
instead of taking them. What is obtained is what remains after the indicated 
elements have been deleted from the array. 


APL Practice 


1. 


3. 


Let X+2 4 4p 32 ?32. That is, X consists of two planes of 4 by 4 
matrices each. Perform the following operations and examine the 
results with a view to understanding how to handle multidimensional 
arrays. 


(a) +/X (m) 0 0 1 4X 

(b) +7X (n) 5 3 2 +X 

(c) +/[11X (o) 1144111 4X 
(d) +/[2]xX (p) +\+/X 

(e) +/L31X (q) 0 0 O¥0 0 O tX 
(f) -/£11% (r) +/ 140470014 4% 
(g) -/X (s) 1001/£21X 
(h) +/C1IX0233] (t) 1010 /03]X 
(i) +/+\X (u) (8p01)\£2IX 

(j) +/+\011X (v) 1,X 

(k) +/+\[21X (Ww) 1,0,1,X% 
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Use the matrix X of exercise 1 to examine the following APL opera- 
tions. Before carrying out each operation, try to predict the result. 


(a) 26x (j) 21 3 ex 
(b) 0ox (k) 31 2 & 
(c) 1x (I) $,X 

(d) ~1o6[11X (m) 1 0 $,X 
(e) 16f21x (n) 0X,£2]x 
(f) “10 31x (0) (£21X) ox 
(g) ox (p) 1.x 

(h) &X (q) X,1 

(i) 12 3 & (r) X31 


Use the matrix X of exercise 1 to examine the difference between: 
(a) 1 1 14% and 1 1 2 tx 
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(b)0 00 4X and 234 4X 


(c) +/L1Ix and (1 3 4 +X¥)+1 00 4X 
(d) X0323] and 2 3 1 +10Xx 
(e) ox and $L1]X 


4. Use the matrix X of exercise 1 to: 
(a) Compute the left inverse of X. 
(b) Insert a row of zeros between the 3rd and 4th row of each block. 
(c) Insert two blocks of zeros between the two blocks of X. 
(d) Select the second block with a column of 1’s added in the begin- 
ning. 
(e) Form a block-diagonal matrix consisting of the two blocks of X. 


(f) Form a matrix W of dimension 3 x 4 consisting of the Ist and 3rd 
rows of the first block of X and the 2nd row of the second block of 
Xx, 


5. Define a matrix A with elements 
(—1)? G2) i<j 
ay =) (-D"! i=j j=1...10 
0 i> 
where (/=}) is the binomial coefficient. Verify that A? = J. The 
matrix A is said to be orthonormal; it is its own inverse! 


6. For the matrix X of exercise 1, use the logical function, =, to 
show that 
ae a _ coe "] 
0 X[2;;3] ~ LO A [srt 
A matrix such as 
le (153] 7 
0 X{2;;] 


is called block diagonal, where X [1;;] and X [2;;] are the blocks. 


7. Consider the square matrix 


1 2 3 
A=j15 1 5 
4 8 12 


Use the computer to verify : 

(a) That A is singular (i.e., has no inverse). 
(b) That the matrix e4 is not singular. 

(c) That C = A + B, where 
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10. 


0 0 0 
B={0 0 0 
0 0 0.1 


is nonsingular. 

(d) That (7 + B)A yields a matrix identical to A except that the 
3rd row has been replaced by the third row plus 0.1 times the 
3rd column, while A(J + B), where J is the identity matrix, 
has a similar effect upon columns. 


ForA =[} $9] and B = [3 %] show that AB = BA using the logical 
function ‘=’ 


Given the matrices: 
—~| -] 
Y= [ 1 0 X= LXijdvxx 


where X;; = (—1)* (7), (4) is the binomial coefficient, NM = 10, 
j=il,...,10,i=1,... , 10, find the limits of 

(a) lim, .. ¥” 

(b) lim, X” 


Assume that the array named ALL consists of 12 matrices, one for each 
of the 12 cities in which a publishing company sells some of its 
magazines. Assume further that each matrix consists of five rows that 
represent locations within the city, and that each row has 16 elements 
representing the number of magazines sold per month by each of the 
16 newsstands in each location. How would you write in APL a pro- 
gram to determine: 

(a) The total number of magazines sold in each city during the 

month. 
(b) The newsstand with the maximum sales volume in each city. 


(c) The total volume of sales. 


(d) Let P be the row of the 12 different prices charged, one for each 
city. Show how you would find the city with the maximum sales 
in dollars. 


Statistical Applications 


Data for these applications are in appendix E. 


1. 


Find estimates of the coefficients of the regression 
I, = ayo + a, RL, + 


where J, = level of investment in the U.S. for the years 1950 to 1978, 
and RL, = the long-term interest rate for the U.S. for the same 29-year 
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period. Recall that if you use REGRESS, the data arrays must be lists, 
not two-dimensional arrays. 


2. Suppose you wanted to rerun the regression of exercise 1, 
I= a) + a,RL, + uw, using the information (which somehow you 
managed to get) that the true variance of u, is 225. How will this new 
information change your regression coefficient estimators and esti- 
mates? How about their estimated standard errors? 


12.1 Inner Product: 


Inner Product 
+.X 
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Inner and Outer 
Products—Matrix 
Manipulation 


The previous chapter gave us a variety of methods for rearranging, select- 
ing, and building up multidimensional data arrays. But multidimensional 
arrays also enable us to perform with ease a number of complicated statis- 
tical computations. For example, we frequently require the sum of prod- 
ucts of observations on two random variables. Another frequent require- 
ment is the preparation of tables and the need to perform some operation 
on all the elements of an array with each element of another array. The 
functions below facilitate these operations. 


Some New Ideas 


You have already seen the inner product in connection with matrix multi- 
plication. In that case two matrices, say A and B, were operated upon using 
the three symbols that form what is sometimes called the ‘‘plus-times’’ 
inner product. We form two matrices 


A<2 2018 
Be2 204418 
A 
1 2 
3.4 
B 
5 6 
7 8 
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The plus-times inner product is written as 
At+.xB 

19.22 

43 50 


To review, the (1, 1) entry is formed by (1 x 5) + (2 x 7) = 19; the (1, 2) 
entry by (1 x 6) + (2 x 8) = 22; (2, 1) by (3 x 5) + (4 X 7) = 43; and (2, 2) is 
computed by (3 x 6) + (4 x 8) = 50. We have, in effect, placed the 
multiplication symbol between the elements ‘“‘across’”’ the second coordi- 
nate of A and down the first coordinate of B. Then the summation sign is 
inserted between these product pairs. 

Suppose that we reversed the + and x signs? We would have 


AX.+B 
54 70 ~ 
88 108 


The (1, 1) element is computed by (1 + 5) X (2 + 7) = 54, and the (1, 2) 
element is (1 + 6) x (2 + 8) = 70. So you can see that the ‘‘inner product’’ 
is more powerful than you might have expected. Another example is 
maximum-times inner product: 


Al .xB 
14 16 
28 32 


The first element is computed by (1x5)[(2x7)=14. Another function 
might be called the max-min inner product: 


Al .LB 
Pay) 
44 


where the first element (1, 1) is computed by (1 | 5) f (2 L 7)=2. For 
example, suppose that A and B represent expenditures on a group of com- 
modities over time by two individuals. You wish to discover what was the 
largest of the minimum expenditures between the two. 

These operations can be performed on higher-order arrays of different 
dimensions. For example: 


A«2 2 2 0i8 
B<2 2 o1 
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5 6 
7 8 
B 
1 2 
ple 
A+.x*B 
gg 
7 7 
11 11 
15 15 


The shape of the result of this command is 2 2 2. The shape always drops 
the /ast coordinate of the array on the /eft and the first coordinate of the 
array on the right. The ‘‘dropped’’ coordinates must be equal to each 
other—this is called conformability of the two arrays. Just as before, the 
multiplication symbol is placed between the elements of the last coordinate 
of A and the first of B. 


axp+2Q2xN)= 3 Uxh+(2x1 
B3x1l+4xD)= 7 BxiI+(4x1 
SxD+6xXD=11 (5X14+(6X1 
(7x 1) + (8 x I) = 15 (7X 1+(8x)=1 


Another operation is illustrated by the following example. If we wanted 
to obtain the matrix product of the matrix in plane one of A with that of the 
matrix in plane two of A, we would key in 


)= 3 
= 9 
)=11 

5 


Ali33J+.xAl2;33] 
19 22 
43 50 


This is exactly what we obtained in the first example of this section. If the 
two planes represent observations on two vectors of random variables, 
each observed over time, then the above operation produces the raw mo- 
ment matrix of the random variables. Thus the typical (i, /) element of the 
inner product is mathematically 2,x;ix,yj,, where x;, and y,, are the kth 
observations on x; and y,, the ‘th and jth elements, respectively, in the x 
and y vectors of random variables. 


12.2 Outer Product 


Outer Product This instruction enables you to place a primitive dyadic function between 
o, ng each element in the corresponding positions in the two arrays. (Is that a 
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mouthful of words! And only a few weeks ago you didn’t even know that 
APL means A Programming Language!) This function is constructed with 
two symbols, the null or jot (the little ° above the J)and the period. Its 
general form is 


Ao. fn B 


where A and B can be either scalars or matrices. o. is called ‘‘jot dot.”’ 
One use that is made of this instruction is preparing tables. Here is an 
example of a simple arithmetic table for addition. 


123 4 5°.401234 5 
123456 
234567 
345678 
456789 
56789 10 


The first row is computed by adding to one (the first element on the left) all 
the elements on the right of the outer product function. The second row is 
formed by adding the second element of the left argument to each element 
of the right argument. Here is another way to visualize the operation: 


Notice that we have written in the column and row headings and the 
addition symbol. Let’s try another simple arithmetic table, only this time 
let B be a three-dimensional array. 


B+2 2 2p 4+18 


B 
(ae 2 
L 0 
fi. 32 
3 4 
C+O 1 20.xB 


C 
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0 0 
¢) ¢) 
Block 1, given by 0 x B 
0 0 
0 9) 
“3. 2 
“4 0 
Block 2, given by 1 x B 
1 2 
3 4 
“6 4 
"2 0 ; 
Block 3, given by 2 x B 
2 4 
6 8 
ec 
3 2 2 2 


Notice that the shape that results from the outer product is the catenation 
of the shapes of the arrays A and 8, 9 (A) = 3, P (B) =222. C has3 
blocks of 2 planes of 2 rows by 2 columns. The arithmetic is just as easy as 
it is in the simple case. The scalar zero is multiplied by every element of 
the three-dimensional array B, then | x B is formed, and then 2 x B. Feel 
free to experiment with other dyadic operators or combinations of 
operators. However, be aware of how much storage, printing, and com- 
puter time you are using, since this is a ‘‘super-powerful’’ command. 
Consider how many values 4°.*B would generate if the shapes of A and B 
were each (5 § 5 5). 


12.3. An Economic Example (Production Functions) 


An interesting use of outer product concerns production functions. Sup- 
pose that we have estimated the parameters of a production function and 
we want to examine the properties of the estimated function. We start with 
a modified version of a Cobb-Douglas production function: 


Y, = S5L¢KRevie" 


and our estimates of the parameters could have been 


Y, = 120.0 L?°K)5e3% 

Y, might be the amount of oil pumped from various drilling platforms, L; 
equals the amount of labor at each platform, K,; is the amount of capital 
services at each of the platforms, and D, is a binary variable that takes on 
the value | for offshore rigs and 0 for onshore rigs. Y is the estimate of the 
conditional mean of Y;. The Greek letters are-symbols for the parameters 
that were estimated, and U; is a disturbance term distributed normally with 
zero mean and constant variance. 
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Our objective is to explain the economic implications of this equation. 


We write an APL program using the outer product command. 


VPROD 


[1] Y120x(*0.3xD)o.x(Z*0.5)0.x(Kx0.5) 


[2] Vv 


Working from right to left we raise K to the 0.5 power. Next perform outer 
product multiplication with L raised to the 0.5 power. This two- 
dimensional result is combined with e°*” using the outer product multiply 
again. The final three-dimensional array is multiplied by 120. 
Notice that we did not code the equation exactly as it was first written. 
Since the first dimension of the result is the coordinate of the /eft-most 
variable, we put D on the left. This allows us to see most easily the effect of 
drilling location on productivity. We generate some hypothetical data: 


[I«i10 
K#15 


168.7056275 
207. 8460969 


268.3281573 
293.9387691 
317.4901573 


339.411255 


3739.4733192 
161.9830569 
229.078636 

280.562845 

323.9661138 
362.2051265 
396.7758364 


169.7056275 
240 
293.9387691 
339.411255 
379 .4733192 
415.6921938 
448.9988864 
480 
509.1168825 
536.6563146 
229 .078636 
323.9661138 
396.7758364 
458.1572719 
512.2354022 
561.1257691 


207. 8460969 
293.9387691 
360 
415.69219 38 
464.7580015 
509.1168825 
5439.9090834 
587.8775383 
623.5382907 
657.267069 
280.5628845 
396.7758364 
485.9431707 
561.1257691 
627.3576818 
687.2359079 


~411255 
- 6921938 


- 6563146 
8775383 
-9803147 


- 8225099 


~9466384 
- 9661138 
-1572719 
1257691 
~9322276 
~4102529 


5516728 


268.3281573 
379.4733192 
464.7580015 


536.6563146 


657. 267069 
709.929574 
758.9466384 
804.9844719 
848.5281374 
362.2051265 
512.2354022 
627.3576818 
724.4102529 
809.9152845 
887.217742 
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428.5668852 606.0851014 742.2996196 857.1337704 958.3046882 
458.1572719 647,.9322276 793.5516728 916.3145438 1024.470804 
485.9491707 687.2359079 841.6886536 971.8983415 1086.615379 
512.2354022 724.4102529 887.217742 1024.470804 1145.39318 


The array Y has too many digits in each number for us easily to see what 
is happening. Let’s get an array of approximations to Y that will be easier 
to read. Let’s try 


YI<LY 

YLT 
120 169 - - - 
168 240 - - - 
207 293 - - - 


240 - 


This array has the following form: 


— Increasing K 


On-shore increasing 


First Table Entries 


Increasing K 
Increasing 
L 


Second Table Entries 


Off-shore 


You might also want to see the effect that an increase in the output elastic- 
ity of capital from 0.5 to 0.7 would have on output with various combina- 
tions of inputs. Consider 


VPROD[1] 
[14] [410] 
[1] Y¥<120x(*0.3xD)o.x(L*0.5)°.x(K*0.5) 
[1] Y«120x(*0.3xD)o.x(L*0.5)0.x(K*0.7) 
[2] Vv 
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120 194 258 316 370 


161 263 349 427 499 


323 526 699 854 999 
362 588 781 955 1117 
396 644 856 1047 1224 
428 696 924 1130 1322 
458 744 988 1209 1413 
485 789 1048 1282 1499 
512 832 1105 1351 1580 


These data can be plotted to give a clearer picture of the results. 

You have seen to some extent how higher-order arrays can be used in 
APL to simplify your statistical calculations. However, it is easy to be- 
come confused at first. It will help you to avoid confusion if you experi- 
ment with small samples of data and check your work at each stage. We 
have kept the examples small so that they were relatively easy to check by 
hand. The last example on the production function was more ambitious, 
but now you are at a point where you can probably use APL to check your 
APL. Applications that are even more ambitious, but more useful, will be 
discussed next. 
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12.4 Two More Not-So-Elementary Matrix Operations (Kronecker 
Product, Determinant) 


Kronecker 
Product 


* Kronecker Product 


Later on, if you continue to study econometrics and statistics, you will 
find a great need for a mathematical operation called the Kronecker prod- 
uct. The Kronecker product of two matrices > and W is written mathemat- 
ically as & @ W, where & and W are square matrices of dimensions (n X n) 
and (m X m), respectively. The result, say C, is of dimension (nm x nm), 
and is defined by 


OnW o2.W ---: o1,W 

O2W ox2W +--+ oo,W 
C=l@QW=] | 

On W On2.W se OnnW 


where > and W are the matrices 


On Dig °°" Gin Wit st Wim 

O21 O22 *** Gan Wei “Wan 
== W = 

Ont Onn °° * Onn Wm “°° Wim 


and o;;W represents scalar multiplication of the matrix W by the scalar o,,. 

This may at first glance look like a complicated product to obtain, but by 
now you know that in APL the required computation will be easy. We use 
the outer product introduced earlier in this chapter. 

Consider first the straightforward multiplicative outer product Ao.x8, 
which multiplies each element of the matrix A by each element of matrix B. 
So far so good. But the dimensions of the result are not exactly what are 
wanted. If D is the result of 4°.x28, where A is (n X n) and B is (m X m), 
then D has dimensions (nn m m), that is, n blocks ofn planes of m rows and 
m columns, or we have n-squared (m xX m) matrices. You may recall that 
the shape of Ae.~2 is the catenation of the shapes of A and of B. But we 
want an (mn X mn) matrix. The desired rearrangement of the result can be 
achieved by using reshape and the dyadic transpose, & (key upper shift 0, 
backspace, Key \). 

As you will recall, the monadic transpose, when applied to a matrix, 
transposes rows and columns, but what about this strange beast of a mul- 
tidimensional array D? QD would merely give us m blocks of m planes of 
n X n matrices; the order of dimensions is merely reversed. Thus, to get 
what we want, we need to do two things: rearrange blocks, planes, rows, 
and columns so that then we can reshape the result to get an (mn X mn) 
matrix. 
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Let’s consider the problem of getting A < B, where 


A -(! 2) B  |124 
(2x2) \3 4 (3x3) |1 3 3 
14 2 


First, let’s see what the outer product gives us (assign values to A and 
B first!): 


1 2 4 
1 3 3 this is equivalent to a,,B 
1 4 2 
Block 1 
2 4 8 
2 6 6 this is equivalent to a,.B 
2 8 4 
3 6 12 
3 9 9 this is equivalent to a.,B 
3 12 6 
Block 2 
4 8 16 
4 12 4 this is equivalent to a.B 
4 16 8 


If we try to reshape the above as it stands, we get the wrong results; try 
it and see. The solution we want is obtained by use of the dyadic transpose. 
Our solution is given by recalling that if we label the positions in A°.xZ by 
I; J; K; L], we want to rearrange them so that we get instead [/; K; J; L], to 
get 2 blocks of 3 planes of 2 rows by 3 columns, which can now be re- 
shaped into a (6 X 6) matrix. Consider then 


1 3 2 4 QD 


If you type this into the computer and look at the output, you will observe 
that raveling the result (which proceeds by raveling the first matrix in the 
first plane in the first block, then the second matrix in the first plane in the 
first block), you will get an array that can be reshaped into that required for 
the Kronecker product. Let’s see it. 


6 6 p13 24 QB 
1 2 4 2 2 4 8 


1 3 3 2 6 6 
1 4 2 2 8 4 
3. 6 12 4 8 16 
3.9 #9 4 12 12 


Determinant 
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The dotted lines have been inserted to aid you in relating the output to the 
definition of a Kronecker product. 

Dyadic transpose gives us another advantage—an easy way to get the 
diagonal elements of a matrix. Let K be defined by 


K*6 6 01 324 OD 
Try 


11 Qk 
132412 8 


In short, using 11 is how to get the diagonal elements of a matrix. 
The trace of a matrix is the sum of its diagonal elements, so that in APL 
the trace is 


+/1 1 OK 
30 


Determinant 


The next matrix operation you may need is the determinant, written as 
|A]. The determinant of a square matrix A of dimension (n x n) can be 
written as a linear function of determinants of submatrices of A of dimen- 
sion (nw — 1) X (nm — 1); in short, the determinant can be defined for n = 1 
by A itself. Forn = 2, |Alis given by (a,,4@22 — @1242,). For larger n, if we let 
Ay = (-1)'*D,, where Dj; is the determinant of the matrix obtained from 
A by deleting the ith row andjth column, and let aj; be the (i, /) th element 
of A, then 


|A| = TLiaydy 


for any i = 1,2,...,. This is the usual expansion by minors, which we 
can program quite easily in APL. Let’s begin by writing things out using 
paper and pencil. 

We see that the routine for calculating the determinant can be decom- 
posed into two main components: the calculation of the determinant of a 
2 < 2 matrix, and the recursive definition of a determinant of a matrix of 
dimension n in terms of determinants of matrices of dimension (n — 1). So 
the basic elements of the routine are these two components, plus a decision 
component that enables the computer to know which of the other two 
components to calculate. 

The determinant of a 2 < 2 matrix is easily programmed. Consider 


De(x/1 1 Q4)-AD1;2]xAT251]. 
D 


2 


which is nothing more than the APL version of (@,,;@22. — @,:d2;). You will 
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recall that 1 1 & A gives the diagonal elements of A, so adding x/ yields 
the product of the diagonal elements. 

If E is a vector containing the signs associated with the minors D,,, that 
is, E is a vector with elements | —1 1 —1.. . generated by (—1)'*’ for 
jJ=1,2,..., n, then the recursive definition is 


D+A[T1;] +.* (Ex™M) 
which is the APL version of 
D}2a,(—-1)'"D,; = Dray jAi; 


Looping 


We note from our definitions that the vector M has to be created element 
by element by getting the determinant of various submatrices. In short, to 
evaluate the function determinant, our function has to call itself! Fortu- 
nately, in APL this causes no difficulties, provided that it is done cor- 
rectly. We also see that we have to do the same thing (m — 1) times (for an 
n-dimensional matrix) each time we call the function; this is called ‘‘loop- 


ing.”’ 
In order to define /@, we have to calculate the determinants of the 
matrices A,,,Aj2,. . . »Ai,, where A is the matrix which is the argument of 


our function. A,;, you will recall, is obtained from A by deleting the first 
row and the jth column. To do this in APL, we will need to use the indexing 
of arrays and the ‘‘drop,’’ +, operation (key upper shift JU ). Another 
operation we need in this routine is ‘“‘compression,’’ which is the dyadic 
use of /. 

In our problem we want to drop only one row (the first) and one column, 
but a different one each time; this is where the indexing comes in. If we 
rearrange the matrix A so that the column to be dropped is first, and if we 
keep everything else in the same order, you can easily see (at least by using 
a paper and pencil to try it) that this will give us the submatrices we want. 
Consider the following: 


N+1 0/ pA {We need to know how “‘big’’ A is. 
IND@.N { gives us the index numbers 1, 
) ERASE B 2,...,N. 
B+(NV,N) pO 
B 
0 0 We need a ‘‘dummy’’ matrix for 
reindexing. 
0 0 


T+1 {Initializes our index. 

Produces an index array 
{with the /th element deleted. 
MeM, DET 1 1 ¥BL;INDI<AL;I,IN] 


IN«(I#IND) /IND 


12.4. Two More Not-So-Elementary Matrix Operations 233 


‘fhe last line needs some discussion. First, let’s call our determinant func- 
tion DET and let it have one argument, the matrix whose determinant we 
are trying to obtain. Also, by writing M¢«”,DET (argument), we create a list 
M with determinants as its elements by catenation. 


BL3;INDI]+AL32,IN] 


rearranges the columns of A so that the column to be deleted, indexed by /, 
occurs first in B, and the remaining columns are unchanged in their relative 
positions. 114B deletes the first row and column of B, and DET takes the 
determinant; in short, we get the (1, J) minor by this line. 

We are nearly done—the next most important step is to complete the 
looping through the index / from 1 to N. This is done as follows: 


I+1 {Initializes the loop. 


Gets us out of the loop 
porno cen Cie) foe I exceeds N. 


IN<«(I#IND) /IND 

MeM,DET 1 1 ¥BL;INDI<AL;I,IN] 

BE, (—1*(44+I)) {Produces the vector of signs. 
T+I+1 {Increments J for the looping. 


+ST eae us back to the be- 


ginning to check on J. 


We now have only a few minor details to settle: the original decision of 
whether to go to the n = 2 case and some ‘‘housekeeping chores.’’ For 
example, in order to define M and E in the way in which we have done it, 
they have to be ‘‘defined as lists,’ even if there are no elements in them. 
This is done by setting M <— Wand E< 10. 

The whole routine is 


VD<DET A;N3#3;13;IN;IND;M;B 
[1] «10 / pA 
[2]  7#NDx1(N=2) 
[3] IND<iN 
C4] B+(N,N)po 


[5] JI<1 
[6] M10 
[7] #<10 


[8] ST:+ST1x.1(I > N) 

[9] IN«(I#IND)/IND 

[10] MeM,DET 1 1 ¥BL3;INDI<AL;I,IN] 
[11] E«#,(~1)*(I+1) 
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[12] I+I+1 

[13] +ST 

[14] ST1:D«A[1;]+.x(Fxm) 

[15] +0 

[16] END:D<(x/ 1 1 &4)-A[1;2]xAL2;1] 
Vv 

Let’s try it. 


A*2 291234 


DET A 
“2 
4433012323434 5 
DET Z 
0 
D5 5937125643020 30 
121065321020 
DET D 
278 


With this last one, if you waited around for what seemed like an age 
(actually, about 3 minutes if you are on the IBM 5110—though only a few 
seconds on a large computer), you are right in concluding that the calcula- 
tions seem to be very slow. This example raises a very important issue: the 
straightforward programming of a mathematical statement need not be the 
most computationally efficient way to calculate something. The main prob- 
lem with the above routine is that the calculation of all the subdeterminants 
involves a very large number of operations—a number that increases with 
the dimension of A very rapidly indeed. 

A computationally more efficient way of obtaining the determinant is to 
convert A to a triangular matrix (e.g., all entries below the diagonal are 
zero), and then the determinant is simply the product of the diagonals. 
Here is a more efficient routine.* 


V D«PDET A 
[1] M+(pA)[1] <An alternative to getting dimen- 
sion of A. 
[2] I+«1 <— Initializes /. 
[3] B+AL;I] < Stores /th column of A in B. 
ru] Bliri<o < Zeros out the first J elements of B. 


* This routine was written by our colleague Dr. Richard W. Parks, University of Washington, Seattle. 
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[5] A<A-(Bo.xALT;])+AlT3T] <See below. 
{6] + (N2I<I+1)/3 <See below. 


[7] Dex/ 11 4 <-Obtains determinant by multiply- 
ing the diagonal elements. 


V 


Only lines [5] and [6] require any explanation. The main idea of this 
routine’s algorithm (method of calculating a mathematical formula) oc- 
curs in line [5]. The idea is to transform the matrix A into a triangular 
matrix; in this case all elements below the diagonal are zero. This is ac- 
complished by subtracting from each row of A the elements of A to the left 
of the diagonal. 

Line [6] increments the index / and then returns to line [3] if N = J; 
otherwise we continue to line [7] to calculate D. 


PDET 2 


PDET D 
278 


If you try this routine, you will find that it will execute the matrix D much 
faster than does the previous routine. Indeed, on the IBM 5120 it will do 
the D matrix in about 7 seconds. However, even this routine has its prob- 
lems, since it will not work if any of the diagonal elements are zero. 

While this lesson may have been a bit expensive, the lesson to be learned 
is important. As your routines become more complex, you will have to 
worry about computationally efficient ways of handling the problem. Fi- 
nally, you should realize that the usual mathematical statements, while 
very informative, can often be computationally inefficient. 


Inner Product and Multidimensional Arrays: the general form of 
this product is Afn, - fn.B, where A, Bare arrays, and fn,, fn, are functions. 
For example, A+.xB where A, B are two arrays, gives the usual mathe- 
matical operation of ‘‘matrix product.’? The inner product for multiple 
dimensional arrays is always between the elements of the /ast dimension of 
Aand the elements of the first dimension of B. 


Outer Product, o.fn (keyed by upper shift 7, period, followed by a 
function): general use is given by Ao. fn B where A, B are arrays. The 
output is an array whose dimensions are the cafenation of the dimensions of 
A and of B. Each element of A is an argument to the dyadic function fn for 
every element of B. 

Kronecker Matrix Product: mathematically is given by >@ W, 
where = and W are (nm X n)and (m X m) matrices, respectively. The result 


236 


Exercises 


Inner and Outer Products—Matrix Manipulation 


is of dimension (mm X nm) and each element of > scalar multiplies the 
matrix W. The Kronecker product, K, for two matrices A, B is obtained in 
APL by 


MN<(NW<1 0/ pA)x(Me1 0/ pB) 
K«MN MN p1 3 2 4 QAo.xB 


Determinant of a square matrix A, denoted by |A|: defined mathematically 
by 


|A| = S%,a,A,, fot anyi=1,2,...,n. 
aj = (i, j)th element of A 
Ay = (-1I)D; 


Dj; = |Aj|, called the (i) minor of A. 
Ay = matrix A with ith row and jth column deleted, is the (i )th 
cofactor of A. 


Two illustrative APL routines are given in the text; care is needed in the 
use of both. 


APL Practice 


1. For any list Z, the APL expressions JZH((pL), 1)o1 and LHL+L give 
the same number. What is this number? Why is this result true? 


2. For two positive numbers A and B, the APL expressions 
((4*2)+B*2)*.5 and Ax40B+A give the same number. What is this 
number? What is the explanation? 


3. IfX+0(0,112)+12 and Y<«Q 1 2 3 °.0X, what is in Y? 

4. LetM be any matrix and Z<-(,M>K)/,M where K is any number. What 
isin L? 

5. Let Q¢?N 9 K and F++/[1]Q°.=1X for any two positive numbers N 
and K. What is in F? 

6. Use the outer product to construct the following tables. 
(a) A table 100 by 10 of the powers from 1 to 10 of the integers from 1 

to 100. 


(b) A table 100 by 2 of the logarithms to base e and base 10 of the 
integers from 1 to 100. 


(c) A table 100 by 10 of the Ist through the 10th root of the integers 
from 1 to 100. 


(d) A table of all possible products of the integers from 1 to 30. 


(e) A table of the binomial coefficients (=|) fori <j andj = 30. (See 
exercise 9, Chapter 11.) 
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(f) The following matrix: 


1 p p’ ae T-1 
p 1 p p? eee p’? 
sal: be fee ok ; 
2. 4p 
: fae, ae 1 p 
| i a a p 1 


for T = 30, where p can take the values 0.2, 0.5, 0.7. 


7. Here is how the outer product can be used to plot functions. Consider 
the following steps: 
(a) Y< oX<"11+121 
(b) L<Yo.xX 
(c) Let WeYo.=X+1 
(d) K+','+'[4140=W] 
one blank space 


(e) PLOT<' V+V'[W+1+2x0=L] 
one blank space 


Use this procedure to plot the following functions: 
(a) Y=2X+4 
(b) Y=X°-1 
(c) Y=-3X° 42X43 
8. Since you know that the rank of an idempotent matrix is equal to its 
trace, find the rank of M = I — X(X'X)"'X’', where 


14 1 
X= {2 1 6 


a2 2 


9, The following matrix is called a payoff matrix of firm A which has 
only one competitor, firm B. Firm A has 3 possible pricing policies and 
firm B has 4 possible pricing policies. The elements of the matrix 
represent profit in $1000 of firm A given B’s possible pricing policies. 


B’s Strategy 
[ie 2. <B8e 4 


50 90 18 25 


A’s Strategy 2/27 5 9 95 
3 | 64 30 12 20 


There is another matrix for B which is not given because its elements 
are known to be the differences between the entries inA’s matrix and 
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100 (assuming that only $100,000 profit can be made in the market). If 
A chooses strategy 1, then B will choose strategy 3, because B gets 
$82,000, which is the highest profit for B given A’s strategy. Thus A’s 
one optimum policy (called maximin) is to find the minimum of each 
row and then pick the maximum of those minima. This long verbal 
story is only half a line in APL. Can you write it? What if A chooses 
a minimax strategy? (A minimax strategy is exactly the opposite to 
the maximin strategy.) 


10. Consider the following table: 
Table M 
Possible home mortgage rate 
values this year in % 

10.50 11.00 11.50 11.75 
Possible home 10.50 0.1 0.25 0.4 0.25 
mortgage rate 11.00 0.25 0.3 0.25 0.20 
values next 11.50 0.4 0.25 0.1 0.25 
year in % 11.75 0.25 0.20 0.25 0.3 


11. 


Each element of this table represents the probability of next year’s 
interest rate given the rate this year. Notice that column and row 
elements sum to |. Enter the matrix of probabilities into the variable 
M. 


(a) Find next year’s expected interest rate if this year’s interest rate 
is 11.50%. 


(b) Find next year’s expected interest rates under all possible alterna- 
tive values for this year. 


(c) Let 


10.5 
_ | 11.0 
111.5 
11.75 
Then Mr, will be a vector of next year’s expected values given this 
year’s alternatives. Let’s call this vector r,,,. Then rj.2 = Mr,4, will 
be the potential values in year r,,,.Give alternative interest rates for 
year ¢ + 1. Project possible interest rates up to 5 periods ahead. Do 
they seem to converge to an equilibrium value? 


Enter both routines DET and PDET in your workspace (see pages 
233-235) and find the determinants of the matrices 


1 2 4 1 3 2 
(a) A=1]2 4 1 (b) B= ]1 4 2 
3 1 3 i 1 2 


(c) AB 
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12. 


13. 


14, 


(d) A @B, where ® is the Kronecker product. 
(ec) B@A 
Using the PDET A function (page 234), solve the following system of 
equations utilizing Crammer’s rule, which is explained below: 

2X + 3Y-3Z=7 

X-2Y+ Z=-2 

3X + Yt Z=9 

Solutions by Crammer’s rule are given by: 


ae _ oy _ oz 
x= 7> Y=7> ~ A 


where 

Ais the determinant of the matrix of coefficients. 

Ay is the determinant of the matrix A with the first column replaced 
by the column of constants. 

Ay is the determinant of matrix A with the second column replaced by 
the column of constants. 

Az is the determinant of the matrix A with the third column replaced 
by the column of constants. 


Let C+"1+?10 02. Examine carefully the results of the following op- 
erations. 

(a) 2AC 

(b) 1vC 

(c) Ove 

(d) C+.AC 

(e) Co.ac 

(f) Co.¥C 

(g) Co.nxC and compare to (e). 

(h) Co.vC and compare to (f). 


(a) Assign the statement ‘‘J don’t like APL and tea’’ to variable X. 


(b) Write an APL function that will erase the n, the apostrophe and 
the t of the word don’t using the logical operators. 
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For those of you who already know a fair amount of statistics, here is the 
chapter you have been waiting for. Now is the time to come to grips with 
regression analysis, the calculation of confidence intervals, and tests of 
hypotheses. In this chapter you will learn the advantages of being able to 
do all your statistical analysis yourself instead of relying on someone else’s 
black box. Before you proceed, you might want to refresh your memory 
about simple linear regression (Chapter 9) and the use of H (Chapter 10). 

This chapter deals only with the analysis of single-equation (as opposed 
to multi-equation), and linear (as opposed to nonlinear), regression equa- 
tions. You might well be wondering why we have a whole chapter on 
regression, when the regression of a vector Y on a matrix X is obtained in 
APL by Y # X. The answer is that there are a number of associated 
statistics with a linear regression which need calculation and these are a 
little messier computationally. Let’s begin. 


13.1 Covariance and Correlation Matrices 


Moment, 
Covariance and 
Correlation Matrices 


240 


The basic input to any regression problem is an (N X K) matrix of N 
observations on K regressors (variables used to “‘explain’’ the dependent 
or regressand variable Y). An important set of statistics for many reasons is 
the sample moment, covariance, and correlation matrices. If X is the 


Simple Correlation 
Matrix of Regressors 
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(N X K) regressor matrix, the mathematical statements are, in matrix 
terms, 


Moment matrix X'X 
Covariance matrix X'X — X'X _ 
Correlation matrix D™?(Xx’xX —- X'X)D™'? 


where you may recall that X'X is the inner product between X transpose 
and X itself, ¥ is an N X K matrix, each of whose rows contains the array 
of means of the K regressors, and D is a diagonal matrix whose ith diagonal 
element is the variance of the ith regressor variable, so that D~'? is the 
diagonal matrix of the inverse of the square roots of the nonzero elements 
of D. 

The above mathematical equations give the matrix equivalents of the 
variances and covariances we calculated in Chapters 5 and 9. Each of 
these matrices is easily obtained in APL. They are, in turn, 


Moment matrix: MM<(QX)+.xX 

Covariance matrix: CM<MM-(XBo.xXB++7/X)=(pX)[1] 

Correlation matrix: CRM+D+.xCM+.xD48(((1K)°.=1K<(pX)027) 
xCM)*0.5 


Let’s explain this. The moment matrix is easy: &X produces X trans- 
pose and (QX) +.x X gives the inner product of 8X with X. The 
covariance matrix introduces something we used briefly in Chapter 9, 
namely (oX)[1] , which means: shape of X over the first dimension, in 
short, the number of rows. The rest is easy to understand once you realize 
that X'X, where X is an (N X K) matrix of the means of the K variables, is 
simply N (= number of rows = (p*)[1]) times the outer product of the 
array of means, or 1/N times the outer product of the vector of column 
sums of X. 

The correlation matrix is obtained from the covariance matrix by divid- 
ing the covariance between the ith and jth variables by the product of the 
standard deviations (square roots of the variances) of the ith and jth vari- 
ables. The variance of the ith variable is clearly given by the ith diagonal 
term of CM. The elements of CRM, usually labelled in mathematical nota- 
tion as p = Cov(x;, x;)/V Var(x) Var(x,), satisfy the mathematical con- 
straint —IlSp,<1. For further mathematical and statistical details see 
Mendenhall and Reinmuth or Kmenta, both of which are listed in the 
bibliography at the end of the book. 

The APL expression is now fairly obvious. K is the number of regres- 
sors. ((1X)°.=1X)xCM produces a diagonal matrix with the diagonal ele- 
ments of CM, and [(((1X)°.=1X)xCM)*.5 gives the inverse of the 
square roots of the variance obtained from the CM matrix; the reason that 
we cannot use * 0.5 is that the zero off-diagonal terms will give 
DOMAIN ERROR. The operation D+.xCM+.xD produces a matrix with pj on 
the off-diagonal positions and 1’s on the diagonal. 
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For example, let us suppose that we have the following X matrix in the 
computer’s memory: 


100 5.50 

90 6.30 

80 7.20 

70 7.00 

70 6.30 

xX _ 70 7.35 
(12x 2)” 70 5.60 
65 7.15 

60 7.50 

60 6.90 

55 7.15 

50 6.50 


and if we don’t, let’s enter X now. 
Consider a simple routine to calculate the moment matrices that we have 
been discussing. 


VY MATLUI] V 


VCRM*MAT X;D;MM;CM;K;XB 
[1] 'MM EQUALS! ;MMe(QX)+.x*X 
[2] 'CM EQUALS ;CM<MM-(XBe.xXB<+#4X)+(oX)[1] 
C3] kK+(pX)E2] 
C4] 'CRM EQUALS' 
[5] CRM«D+.xCM+.xDi(((1K)°.=1K)xCM)*0.5 


MAT X 
MM EQUALS 

61050 5577.5 

5577 544.2075 

CM EQUALS 

2250 “54 

“54 4,857291667 
CRM EQUALS 

1 “0, 5165417262 


~0.5165417262 1 
And if we have the following three-column regressor matrix: 


LEZ 
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55 100 B75 
70 30 6.3 
30 80 732 
100 70 7 
30 70 653 
105 70 735 
80 70 546 
110 65 7545 
125 60 TS 
115 60 6.9 
130 55 TAS 
130 50 6.5 
Our routine produces: 
MAT 22 
MM EQUALS 
126300 80450 8170.25 
80450 61050 5577. 
8170.25 5577.5 544.2075 
CM EQUALS 
6300 ~3550 195.25 
“3550 250 pee 
125225 “54 
CRM EQUALS 
1 ~0.942902569586 


O.942902569586 1 


4,85729166667 


70 .516541726216 


0.715995624984 ~0.516541726216 
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0.715995624984 


A. 
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The first generalization of the regression model which was discussed in 
Chapter 9 is to allow for more than one regressor; to some extent we have 
done this already with the use of the function &. The general model to be 


discussed in this chapter is 
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Y=XB+U 


where Y is an (N X 1) regressand vector, X is the N x K matrix of regres- 
sors, B is the K X 1 vector of regression coefficients to be estimated, and U 
is the N X | vector of unobserved disturbance terms. Excellent elementary 
discussions of multiple regression analysis can be found in both of the 
references mentioned on page 241. 

From our work in Chapters 9 and 10, you know the least-squares ap- 
proach to estimating the regression coefficients involves the calculation of 
the left inverse of X. The left inverse of X is (X' X)7'X’ (recall that A,, is the 
left inverse of A if A,,A = J) and in APL this is obtained by x, using the 
monadic use of domino. The least-squares regression coefficients are ob- 
tained by the dyadic use of domino, YX, which is the APL equivalent of 
the mathematical expression (X’X)"'X’ Y. Mathematically, we obtain from 
(X'X) |X’ Y the expression B + (X’X)"!X’U. The statistical properties of 
the least-squares estimator depend on the statistical properties of the vec- 
tor (X’X)~'X’ U. Let’s store the regression coefficient estimators in BE, i.e., 
we compute BE<YEHX; BE stands for B vector Estimate. 

AS we mentioned in the note to instructors, some computer systems 
have not implemented the dyadic form of domino. An easy way around this 
is to write your own function directly. 


VY DQ X 
C1) (HCC QX) +.%X) +. x C(QX)+4.*%Y)V 


Here is how it works for a 3 independent variable case where the intercept 
is constrained to be equal to zero. 


CLEAR WS 
Y¥«10?710 
X<10 3p30 ?30 
YE 
0.076969 0.0021871 0.24941 
(E] (CC QX)+.xX) D+. x (QX)+.xY) 


0.076969 0.0021871 0.24941 


The estimator of the variance of the disturbance term was given for the 
special case examined in Chapter 9. The general mathematical expression 
is 


Var(U) = 3,(Y, — ¥)2/(N — K) 


where Y = XB, Y’ = (¥,, Yo,. . . , ¥,), where A symbolizes the estimator 
of the variable or parameter under the 4. Thus B symbolizes the estimator 
of B in the model Y = XB + U, where B = (X'X)"'X'Y. 

The maximum-likelihood estimator of the variance of U is a little differ- 
ent, involving division by N instead of N — K. Y; is the forecast value of Y; 


a 


for the ith observation given the regression coefficient estimates, i.e., Y; = 
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=) XB; B, j = 1,2, .. ., K are obtained in APL by calculating the 
vector BE defined above. The APL expression for the estimator of the 
variance of U is 


NP# ox 
V «(SSE<( Y-X+.xBE)*2)+(-/NP) 


and in the process, we have defined the ‘‘error sum of squares,’’ SSE, 
which will be needed later. Note that NP is the array (N, K), and -/NP is 
simply (N — K). Y is obviously given by X +.x BE. 

The estimated covariance matrix of the regression coefficient estimators 
has two forms, depending on whether or not the regression equation is run 
in ‘‘deviation terms’’ [that is, after elimination of the ‘‘constant vector’’ 
’ =(1, 1,..., 1) by subtracting the mean from each variable]. If the 
regression is run in deviation terms, the estimated covariance matrix for 
BE is given by 


COVD « V x CM 


If the regression is not run in deviation terms, the estimated covariance 
matrix for BE is given by 


IT«i1 
RM <« (QZ) +.x Z2« T,X 
COV <Vx RM 


The dimensions of COVD are (K x K) and of COV, ((K + 1) x (K + 1)), 
when there are K regressors in X. The use of the catenate function in 
creating the matrix Z from J and X was discussed in Chapter 11. 

The coefficient of determination (R?) was defined in Chapter 9; the mul- 
tiple correlation coefficient is merely the square root of R?. Note that the 
coefficient of determination defined in Chapter 9 by 1-SSE/SST gives the 
coefficient of determination with respect to variables defined in terms of 
deviations about their respective means. If the R° value is wanted in terms 
of the original variables, then the new R? is defined as before, except that 
SST is given by Y+.x Y instead of Y+.x Y-((+/Y)*2)=N . 
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Simple and Partial 
Correlation 
Coefficients 


A topic related to regression analysis is the analysis of simple and partial 
correlation coefficients. The simple correlation coefficient has already been 
defined and calculated in this chapter. Given K variables, the simple corre- 
lation between any two of them, say the ith and/th, is given mathematically 
by rj = Cov(x;, x;)/ V Var(x;)Var(x;); the simple correlation coefficient 
‘‘measures”’ the extent of linear association between the two variables i 
and j. 

A related concept is the partial correlation coefficient, say rj;.,, which is 
the correlation between variables i andj after allowing for the joint correla- 
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tions between variables k and i and between variables k and j. The partial 
correlation coefficient can be obtained easily from the simple correlation 
coefficients; mathematically, one has 


Kick = (His — Tietix)/ VC re) C= rix) 


where fj, ix, Fx are the simple correlation coefficients. 


The matrix CRM obtained above gives for any set of K variables the 


matrix of simple correlation coefficients between each of the K variables. 
Any partial correlation coefficient can be obtained from the CRM matrix. 
By now you should be able to program this on your own. 


13.4 Creation of a Regression Routine 


Let us now consider creating a reasonably complete regression routine. 
The procedure will not only provide us with our own regression routine 
which, of course, we can alter in any way at any time we like, but it also 
will provide some valuable lessons in how to write routines that are more 
complicated than those we have tackled so far. Let’s begin at the very 
beginning, which is to determine what we are required to do mathemat- 
ically. Our next step will be to plan our programming steps. Here then is a 
recommended procedure for you to follow in writing any program routine 
of any difficulty. 


1. 


Write down all the mathematical expressions needed; sequence the 
expressions so that all operations to be completed before a particular 
step are in fact listed before that step. One diagrammatic way to help 
you do this is to write down a flow chart, an example of which is 
given for the regression routine below. 


Figure out whether any loops or conditional branch statements are 
needed and, if so, where in the program they are needed. Rethink the 
basic APL approach to see if there are alternative ways to get results 
which will not use up inordinate amounts of computer time and 
space. 


Decide for each function whether the result is to be explicit or im- 
plicit, have one or two arguments, and which variables, if any, are to 
be globally defined. 


Begin with paper and pencil and assemble the various parts of the 
routine that may already be written out. 


Using your flow chart, lay out the sequence of ‘‘one liners’’ and set 
up your decision branches. As you do so, label all statements involv- 
ing branches. 


With each ‘‘one liner’’ expression, start with the ‘‘core’’ of the math- 
ematical term and put it in the middle of the page; build outwards from 
there, and then recheck by reading the expression from right to left. 
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10. 


11. 


Alter the header line as you introduce local variables to avoid forget- 
ting them. 


Recheck your program carefully, looking to see that: 

(a) program statements give the results desired (read from right to 
left!); 

(b) the relationships between arrays, vectors, and matrices are com- 
formable (possible length errors); 

(c) every variable will be defined before it is used; 

(d) loops, branches, and recursive function definitions will terminate 
under all alternatives. 


Document your routine very carefully and include explanatory com- 
ments inside it (within a month, you won’t remember how you did it!). 


Enter the routine into the computer, then enter several easy problems 
for which you have the answers in order to check that the routine is 
correct. Be sure that you check the routine for general, not just 
special, cases. 


Remember, even when you believe it is impossible to make a mistake, 
you will! 


You might well ask: Why all these elaborate instructions? Up until now 
most of our routines have simply been written out. The answer is that so 
far the routines have all been very uncomplicated—the regression routine 
to follow is our first example of a more elaborate routine. Also, it is the first 
routine that we might want to keep for future use, so we need to be more 
careful about how we define it and set it up for use by others. 

The following routine is a suggestion; a lot of variables are global since it 
is assumed that the output will be used in further analysis. We need some 
trial data. Let us suppose that the first column of ZZZ is the regressand 
array Y, and the remaining two columns of ZZZ are the regressor matrix X. 


Thus 


¥+2Z22(031] 
X+Z2Z2Z032 3] 
Y 


55 70 90 100 90 105 80 110 125 115 130 130 


[@>) ~ ~ om 
e « . 
NO w 
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70 7.35 
70 5.6 
65 7.15 
60 7.5 
60 6.9 
55 7.15 
50 6.5 


Let us begin to write out the regression routine by deciding on what it is 
we want. 


Desired Calculations 


regression coefficient estimates 

raw moment matrix 

covariance matrix for regression coefficients 

variance of disturbance term U, 

R? (the coefficient of determination) 

correlation matrix for regressors 

array Y 

array U = (Y — f) 

F statistics with degrees of freedom 

standard deviations of the regression coefficient estimates 


T ratios for the regression coefficient estimates 


Disposition: all of the above to be global variables; but Y, U to be printed 
out automatically; two arguments Y and X, 


An examination of the formulas in Table 13.1 shows that there is a cer- 
tain order in which calculations should be made and certain intermediate 
products which need to be defined as they are reused repeatedly. While 
the mathematical structure of this routine is very simple and hardly calls 
for an elaborate flow chart, one is given in Figure 13.1 in order to illustrate 
the idea. Our main concern is to keep the number of computations and 
numbers of arrays to a minimum, the former to economize on computer 
time, the latter to economize on space taken up in the computer. 

In the process of working up the flow chart, you can see that some ideas 
occurred to us. First, we should remember that a function or a routine is 
defined for general use, so you should be on your guard to watch out for 
problems caused by someone trying to do something which you did not 
anticipate. The first decision box illustrates this problem. Someone may 
inadvertently specify an X matrix with more variables than observations 
(more columns than rows). If no test is made for this mistake, the com- 
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Table 13.1 List of Mathematical Expressions and Tentative APL Statements 


Definition 


Mathematical Expression 


APL Expression 


Moment matrix of 
x 

Covariance matrix 
of X 

Correlation matrix 
of X 

(D is the diagonal 
matrix obtained 
from Cov(X)) 
Regression coeffi- 
cient estimators: 


Total Sum of 
Squares (TSS) 
Error Sum of 
Squares (ESS) 
Var (U) 

R? 

Regression SS 
(RSS) 

If done 

without constant 
term: 

F = (RSS/ESS)/ 
((N — K — 1)/K) 
COV matrix for 
B, where 

Z = (i, x) 
Standard devia- 
tions of elements 
of B given by 
square roots of 
diagonal elements 
of covariance 
matrix for B, S3 
Student ‘‘t’’ ratios 


XX 


XX-XX 


D™'2(X'X _ Xx X)D"? 


Diag(X’X — X'X) 


YY - YY = SU? 


SUAV(N — K — 1) 
1 — ESS/TSS 
sy - ¥? 


var (U)(Z'Z)"! 


S3 = (Diag Cov(B)) 


B,/Ss,i = 1,2,... 


,k 


Mé-(QX)4+ xX 
CMCMM- (XBo .xXB++74X)+(pX) 
CRMCD+ .xCM+ .xD 


<(((1K)°.=1K+(pX)02]) 
xCM)*0.5 


BE<YEX 

YA<X+ .xBE 
UH<Y-YH 
TSS<+ /(Y«YB)*2 
YB«(+/Y)tp¥ 
BSS++ /UH* 2 


VARU*ESS+ (N-K+1) 
RSQ*1-ESS?TSS 


RSS«+ /( YH-YB)*2 


F«( RSS+ESS)x((W-K+1)+K) 
COVB+VARUXE]( (QZ) +.xZ) 


SBE<(1 1QCOVB)*0.5 


TRATIO*BE+SBE 


puter will blithely proceed to the calculation of the regression coefficients 
and then give a DOMAIN ERROR and specify the line in the program at 
which the error occurred. This is all very well, but you may want the 
program which contains the regression routine to continue in any case and 
not stop. If so, a diagnostic test will get you around the difficulty before 
you waste valuable computer time. In addition, you can specify a warning 
variable to indicate when the regression routine has not, in fact, been 
executed; this is called ‘‘setting a flag.”’ 

The second idea is that sometimes we want to carry out the F test on the 
mean as well as on the statistical significance of the regression coefficients 
for the regressor matrix, so we have allowed for two variants of the test: 


RRS, = >(¥- ¥Y)? and RSS, = >(¥? 
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x? 


x,y 
x'x — x’x° 
correl Mat. X, D® 


BE 
YH 
UH 


TSS° 
ESS° 
RSS° 
RSS)’ 


RSQ° 


VARU 
CO V BE 


FY, F3 
St. Dev. BE® 


Student f ratios 


St. Dev. BE, RSQ 
CO V BE 

VARU 

Fi, Fe 


Figure 13.1 
Flow chart for regres- 
sion routine. 
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[1] 


[2] 


3] 
C4] 
[5] 
[6] 
[7] 
[4.2] 
[8] 
[9] 
[4.5] 
[10] 
[3.1] 


[3.2] 


C11] 
[12] 
[13] 
[14] 
[15] 
[16] 
[17] 
[18] 
[19] 
[20] 
[21] 


Sample Main Sheet 
V Y MREGRESS X;MS;D;SS 
a THIS PROGRAM ASSUMES Y TO BE AN ARRAY OF DIMEN. 
(N), X TO BE AN (NxK) ARRAY 
a OF RANK K. A DIAGNOSTIC IS PRINTED IF NskK OR IF 
RANK (X) <K. 
a THE CONSTANT TERM IS ADDED BY THE ROUTINE. 
NP*« 0X 
PREMEND1x1(NPC11<NPL21]+1) 
CM<(MM<(Q X)+.*X)-(MS° .xMS<+74X) +NPC1] 
CRMD+ .xCM+.xDl( (( 1K) °. = 1K) XCM) *0.5 
K+*NPL2] 
MS<((+/Y),MS)+NP[1] 
SSL21<(SS01]<+/¥*2)-NPL11xMS[1]*2 
SS<+5 00 
MAIN: UH<( Y-( YH<X+ .x( BE<YE|(X*«(1,X))))) 
akxxke IF DOMAIN ERROR OCCURS ON LINE [7] OR [MAIN] 
X IS SINGULAR, ROUTINE WILL BE SUSPENDED 
AOPERATOR SHOULD KEY IN: +PREMEND 
IN ORDER TO COMPLETE ROUTINE, ***x 
SSC41+(SSL3 ]++/YH*2)-NPL11xMS[1]*2 
SSU5 ]<+/UH*2 
RSQ<1-SS[5]+SS[2] 
VARU<SSL5 ]+(-/NP)-1 
F1<SSl4]sSS5]xk+(-/NP)-1 
F2«SS[3]+SS(05 ]x(K+1)+(-/NP)-1 
COVBE*VARUXE( QX ) + xX 
STDBE<(1 1 QCOVBE)*0.5 
'THE RAW MOMENT MATRIX OF REGRESSORS.IS:' 
MM 


'THE COVARIANCE MATRIX OF REGRESSORS IS:' 
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[22] 
[23] 
[24] 
[25] 
[18.5] 
[26] 
£20.13) 
[20y3] 


£ a2 ad 
[3.4] 

gates reals 
[18.2] 
[18.3] 
[30] 


Roa] 
Ee? 
[33] 
[34] 
Fos 
[36] 
E37] 
[38] 
[39] 
C40] 
C41] 


C42) 


[43] 


C4u ] 


CM 


MAINEND: ‘THE CORRELATION MATRIX OF REGRESSORS IS:' 


CRM 


‘THE VECTOR OF MEANS; Y,X1,X1, . . .IS:!' 


TRATIO*«BE+STDBE 
MS 
+MAINENDx1FLAG 


"ROUTINE PREMATURELY ENDED DUE TO SINGULARITY OF 


X MATRIX 

+0 

FLAG+1 

+CONT 
PREMEND : FLAG*0 


[19] CONT: ‘THE RAW MOMENT MATRIX OF REGRESSORS IS:' 
'THE REGRESSION COEFFICIENT ESTIMATES ARE IN ORDER 


CONST., H1,X2 . . . 2! 
BE 


'THE CORRESPONDING STANDARD ERRORS ARE:' 


STDBE 


'THE REG. COEF. EST. COVARIANCE MATRIX IS:' 


COVBE 


"RSQ IS: ';RSQ;' VAR. ERROR TERM IS: ' ;VARU 


'THE F STATISTIC INCLUDING THE CONSTANT TERM IS:' 


F2;' WITH ';(K+1),( 1+-/NP); ‘DEGREES OF FREEDOM. ' 


'THE F STATISTIC NOT INCLUDING CONSTANT TERM IS:' 


Fi; ‘WITH ';K,(-1+-/NP); ' DEGREES OF FREEDOM: ' 


! THIS ENDS THE OUTPUT FROM MREGRESS. * 


+0 

PREMEND 1:'NO. OF OBS. 
OF REGRESSORS (K).' 
"ROUTINE TERMINATED! 


(NM) IS TOO FEW RELATIVE TO NO. 
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[33.5] 'THE CORRESPONDING T RATTOS ARE:' 
[33.7] TRATIO 


[45] Vv 


Sample Subsidiary Sheet 
Used for Preliminary Working Out of APL Statements 
Reline [4]: N«pX 
+PREMENDx 1 (NE 1I<NE21+1) 


Reline [6]: MM<(QX)+.xX 

CM<MM- (MS .xMS<++7X)+NL1] 
Reline [7]: CRM<D+.xCM+.xD8(( (1K) °.=1K«N[2])xCM)*0.5 
Reline [8]: MS<((+/Y)+N[1],MS+Nf1]) 


Reline [9]; 7SS2<( 7S5<+/Y*2)-NxMS[1]*2 


Reline [10]: YH<X+.x(BE<YE(X+(1,X))) 


At this point you could get aDOMAIN ERROR and a suspended routine 
if the augmented X matrix is singular. To get out of this and continue 
operation will require operator intervention from the terminal; a flag must 
be set to bypass the remaining analysis and to print only what has been 
done so far. Operator action is indicated in the comment lines labelled [3.1] 
and [3.2]. You will notice that we have just realized that we could also get a 
domain error leading to a suspension in the current line [7]. (This is indi- 
cated on the flow chart by the dotted-dashed line which was added later.) 


Reline [11]: SSC4J<(SS031<+/YH*2)-NPL11xMS[1]*2 
Reline [17]: COVBE«VARUxE( QX)+.xX 


Reline [18]: STDBE«(1 1 QCOVBE)*0.5 


Some of the early calculations not affected by the domain error can be 
printed, but we will need to bypass the remaining items if the routine has a 
domain error. The inserts at lines [3.4] and at [18.1;2;3] do this. 

With the flow chart and our previously developed ‘‘one-liners’’ in hand, 
let us proceed to write out our regression routine. As you do it, start off with 
a large clean piece of paper, work in pencil, and leave yourself lots of room 
for making corrections. Do the more complicated one-liners first on a 
separate piece of paper, then fit them into the main routine. When you 
have finished, rewrite the routine carefully, checking it as you go. Review it 
in light of our comments above, then (and only then), consider keying into 
the computer. Here is a typical sample of trying to write a routine—warts 
and all! Before you read the following paragraphs, glance over the sample 
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program listed on pages 251-253, the detailed comments on pages 246-247, 
and the flow chart in Figure 13.1. 

Some comments are in order. While all the APL operations used in this 
program are familiar to you by now, there are a few items which should be 
emphasized. In line [9] we decided we would put the various sums of 
squares into an array. But executing SS[11]<+/Y*2 would give a value 
error if SS had not already been declared a list! That is why we had to 
insert line [4.5], i.e., this line is to go between lines [4.2] and [5]. Line [4.5], 
SS*5 p 0, sets up SS as an array of zeros. The statements in line [9] put 
sums of squares into various elements of the list SS. 

The inserted lines [4.5], [3.1], and [3.2] illustrate two things. First, we 
promised that we would show how writing a routine is done, including 
mistakes! Quite frankly, we forgot all about the need to define SS as an 
array before reaching line [9] until we, in fact, wrote line [9]. Likewise, the 
inserted statements [3.1] and [3.2] were afterthoughts—in short, we 
goofed. The second item to note is that the routine can be keyed-in just as it 
stands, and on being displayed everything will be rearranged into proper 
sequence by the computer. 

Finally, as you look between the flow chart, the preliminary statement, 
and the sample main sheet version, you will see that we changed our mind 
at times; we hope each change was for the better. If nothing else, this 
shows that the flow chart is a guide to aid you, not an inflexible route from 
which you dare not deviate. 

The routine is not quite finished, since we need to add a series of com- 
ment statements to guide any reader (including ourselves in a month’s 
time). For example, 


[4.8] a CHECK NO. OBS. (WN) > NO. VARS (K) 
[5.1] a THIS SECTION CALCULATES RAW MOMENT MATRIX, MOM. MAT.. 
AND CORREL. MAT. FOR REG. MAT. X. 


and so on. 
A Regression The last steps are to reread the routine carefully to check for errors of 
Routine both omission and commission, key it into the computer, check that the 


keying-in is correct, and then run some tests on its operation under all the 
situations it is meant to encounter. For example, (i) run a regression for 
which the results are already known; (ii) try to enter a regression with 
fewer observations than variables; and (iii) try one with a singular X ma- 
trix. 

Let’s try the routine. Begin by displaying the rearranged routine and the 
X and Y arrays to be used for the test. 


VMREGRESS([OIV 
Vv Y MREGRESS X;MS;D;SS 


[1] aTHIS PROGRAM ASSUMES Y TO BE AN ARRAY OF DIMEN. (WN), 
[2] aX TO BE AN (NxK) ARRAY OF RANK K. A DIAGNOSTIC IS 
[3] APRINTED IF NSK OR IF HANK X < K 


13.4 Creation of a Regression Routine 


[4] 
[5] 


aTHE CONSTANT TERM IS ADDED BY THE ROUTINE. 
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axx*x*xx TF DOMAIN ERROR OCCURS ON LINE 14 OR [MAIN] X IS 


SINGULAR, 


AROUTINE WILL BE SUSPENDED. OPERATOR SHOULD KEY IN: 


AIN ORDER TO COMPLETE ROUTINE. 

FIAG+1 

NP<pX 

K<NPL2] 

SS+5 pO 

+PREMEND1x1(NPL1J<NPL2]+1) 

CM (MM<(QX)+.xX)~(MS° .xMS<« +74X)2NP[ 1] 
CRM<D+ .xCM+.xD«E)((( 1K) °.=1K)xCM)*0.5 


MS«((+/Y) ,MS)+NPL1] 
SST 2)<(SSC1]<+/Y*2)-NPL1)xMSL1]*2 
(X<(1,%))))) 


MATN: UH«(Y-(YH«X+ .x(BE« 


SSL4]<(SSL3]<+/YH*2)-NPL11xMS[1J*2 
TRATIO+BE+STDBE 
SSC5 ]<+/UH*2 
RSQ<1-SS 512882] 
VARU*SSL5]+(-/NP)-1 
F1+SSC4J+SS05]xKk+(-/NP)-1 
F2«SS[3]+SSU5]x(K+1)+(-/NP)-1 
COVBE*VARUxE( QX) +. xX 


STDBE*(1 1 QCOVBE)*0.5 

+CONT 

PREMEND: FLAG*O 

CONT: 'THE RAW MOMENT MATRIX OF REGRESSORS IS:' 
MM 

"THE COVARIANCE MATRIX OF REGRESSORS IS:' 

CM 

>+MATNENDX FLAG 


>PREMEND 


"ROUTINE PREMATURELY ENDED DUE TO SINGULARITY OF X MATRIX! 
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[35] 
[36] 
[37] 
[38] 


[39] 


L4o] 


C41] 
[42] 
[43] 
C44] 
C45] 
C46] 


[47] 


[4s] 
C49] 


[50] 
[54] 
[52] 
[53] 
C54] 
[55] 


[56] 


100 


Regression 


>0 

MATNEND:'THE CORRELATION MATRIX OF REGRESSORS IS:!' 

CRM 

'THE VECTORS OF MEANS,Y, X1, X2, X3, ...... Is;! 

MS 

'THE REGRESSION COEFFICIENT ESTIMATES ARE IN ORDER CONST. ; 


bE 

'THE CORRESPONDING STANDARD ERRORS ARE.' 

STDBE 

'THE CORRESPONDING T RATIOS ARE:' 

TRATIO 

'THE COVARIANCE MATRIX OF THE REGRESSION COEF, IS.'* 
COVBE 

"RSQ IS:'3;RSQ,'VAR OF ERROR TERM IS:';VARU 

'THE F STATISTIC INCLUDING THE CONSTANT TERM IS:! 
F2;'WITH';(K+1),( 1+-/WP);'DEGREES OF FREEDOM.' 
'THE F STATISTICS NOT INCLUDING CONSTANT TERM IS:!' 
F1;'WITH ';K,( 1+-/WP);'DEGREES OF FREEDOM.' 

'THIS ENDS THE OUTPUT FROM MREGRESS.' 

>0 

PREMEND1:'NO. OF OBS (N) IS TOO FEW RELATIVE TO NO! OF 
REGRESSORS (kK) .! 

"ROUTINE TERMINATED! 

V 
Y 

106 107 120 110 116 123 133 137 


13.4 Creation of a Regression Routine 257 


111 113 
115 103 
120 102 


124 103 


126 98 
[These data are from Johnston, Econometric Methods, p. 147.] 
Y MREGRESS X 
THE RAW MOMENT MATRIX OF REGRESSORS IS: 
145571 107690 
107690 101772 
THE COVARTANCE MATRIX OF REGRESSORS IS: 


“4112 648 
THE CORRELATION MATRIX OF REGRESSORS IS: 


1 0.17257 
“0.17257 1 
THE VECTORS OF MEANS,Y, X1, X2, X3, ...... Ts: 


116.89 113 106 
THE REGRESSION COEFFICIENT ESTIMATES ARE IN ORDER CONST. ,X1,X2,...: 
“49,341 1.3642 0.11388 

THE CORRESPONDING STANDARD ERRORS ARE. 

24.061 0.14315 0.14337 

THE CORRESPONDING T RATIOS ARE: 

“2.0507 9.5299 0.79429 

THE COVARIANCE MATRIX OF THE REGRESSION COEF. IS. 


5.7893E2 ~?2.6911F0 ~2.5792E0 
~2.6911F0 2.04938 2 3.54208 3 
~2.5792EF0 3.54208 3 2.0556F 2 


RSQ I15:0.9385VAR OF ERROR TERM [S5:12.924 

THE F STATISTIC INCLUDING THE CONSTANT TERM IS: 
3202.2WITH3 6DEGREES OF FREEDOM. 

THE F STATISTICS NOT INCLUDING CONSTANT TERM IS: 
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45.782WITH 2 6DEGREES OF FREEDOM. 
THIS ENDS THE OUTPUT FROM MREGRESS. 
Now let’s try the oddball cases. Consider: 
@<+1 2 3 
XX*34 9123456789 10 11 12 
@ MREGRESS XX 
NO. OF OBS.(N) IS TOO FEW RELATIVE TO NO. OF REGRESSORS(K). 
ROUTINE TERMINATED. 
Q<15 
XxX*5 3912315618941 1014111 12 13 
@ MREGRESS XX 
DOMAIN ERROR 
MREGRESS( 14] CRM<D+.xCM+.xDdi( ((1K)°.=1K)xCM)*0.5 
>+PREMEND 
THE RAW MOMENT MATRIX OF REGRESSORS IS: 
5 S7 42 
37 337 374 
42 374 416 
THE COVARIANCE MATRIX OF REGRESSORS IS: 


0 0 0 
0 63.2 63.2 
0) 63.2 63.2 


ROUTINE PREMATURELY ENDED DUE TO SINGULARITY OF X MATRIX. 


13.5 Bells and Whistles Section 


In this section we will discuss some ways in which this simple multiple 
linear regression model can be applied to a wider variety of circumstances 
than might at first appear to be the case. 


Transformed Variables 


The first way in which the linear model can be extended is to broaden the 
concept of the regressor matrix used in the regression routine. Let us 


Transformed 
Variables 


Heteroskedastic 
Models 


Lagged Variables 
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suppose that you have a number of arrays called, say, X1, X2, X3, and so 
on. The arrays might represent time series of interest rates, net national 
product, consumption expenditures, population, consumer price index, 
money supply, etc. We might call these the raw data. In the regression 
model we have derived from other considerations, we may want to use not 
the raw data but functions of them, such as 


In(xjj), C78, x9, Xig/Xnjn (Kis + Ata) iss HiG% 5)? 


and so on, where x,;, x;; represent the jth observation on variables x; and 
x;. These transformations are easily handled in APL. For example: 


(¥+10*5) MREGRESS &3 NOBS p(X12X2, @®X3, (X3+X4)+X5) 


enables one to transform variables by arrays and set up the regression 
without having to define a new regressand vector or a new regressor ma- 
trix; and all of this is done in one statement. NOBS is the value of the 
number of observations. 


Heteroskedastic Models 


One important special case of these ideas is the concept of weighted 
regression which frequently arises in heteroskedastic models. These are 
models wherein the covariance matrix of the disturbance term is a diagonal 
matrix with uzequal diagonal elements. Thus, suppose that the variance of 
the ith disturbance term, i = 1, 2,..., N, there being N observations, is 
o?z;, where o? is an unknown scalar and z; is a known constant. Then, if 
the ith observations on the regressand and the regressors are divided by 
/z, the transformed model meets the usual multiple linear regression 
model assumption that the covariance matrix of the disturbance term array 
is o7J, where / is the identity matrix. (For further details about such models 
see Kmenta in the bibliography.) This transformation is performed without 
difficulty in APL. 


Lagged Variables 


The next problem is a little trickier. Suppose that we have three arrays 
labelled Y, X1, and X2 representing time series on three variables of inter- 
est. Now, as frequently is the case in econometric analysis, you want to 
carry out your analysis not just with the original three arrays, but with 
arrays created by /agging the original arrays. An associated problem is that 
the lagging process will reduce the usable length of our arrays. Let’s con- 
sider a lag of length M M can be 1 or 2 or any positive integer less than 
(N — K — 1), where N is the number of observations (or length of the 
arrays) and K is the number of regressors to be used in the regression 
analysis. 

With lagging, the regressors will include not only X1 and X2 (the original 
regressors), but also the lagged values of Y, X1, and X2. With lags of length 
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M, and N the length of the original array, the variable length of the new 
array is only (N — M). Let’s call the lagged variables YL, X1L, and X2L. 
Consider, therefore, a lag of length M: 


YL + YIP + 1N-M] 

X1L « X10P] 

X2L + X20P] 

(3 Np (Y,X1,X2)) + (M p0,(N-M) p 1)/34N 9 (Y,X1,X2) 


The reader might recall that here we have used the dyadic form of /, called 
compression. Essentially, we have dropped the first M@ elements of the 
arrays on the right. 

We now have six arrays of length (N — M), the former three of 
which— Y, X1, X2—contain observations from the (M + 1)st to the Nth, 
and the latter three— YL, X1L, X2L—contain the first (N — M) observa- 
tions from Y, X1, and X2, respectively. Just to check these ideas out, let’s 
try it, using the Y array and X matrix defined earlier. Let M be 2. Thus we 
key in: 


Y 
100 106 107 120 110 116 123 133 137 


x 
1 100 100 
1 104 99 
1 106 110 
4 “babe, 26 
qa 413 
1 115 103 
1 120 102 
1 124 103 
1 126 98 

ox 
9 3 

Me2 


XL+Y[P+iN-M] 

XE+X[P 31 2 3] 
Y+((M p0),(N-M)p1)/¥ 
X((Mp0).(N-M)p1)4X 


Durbin-Watson 
Statistic 
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Y 


107 120 110 116 123 133 137 


x 
1 106 110 
1 111 126 
1 111 113 
1 115 103 
1 120 102 
1 124 103 
1 126 98 


A comparison of the new arrays Y and YL and the matrices X and XL will 
show that we have accomplished our objective. Examine the use of pa- 
rentheses very carefully in the expression ((/@p0) , (W-M)p1). 


Durbin- Watson Statistic 


Now consider a useful test statistic in linear regressions involving time 
series—the Durbin-Watson statistic (see, for example, Kmenta, pages 
295-97). From MREGRESS we get a globally defined array called UH, 
which contains the forecast error terms. Mathematically, the Durbin- 
Watson statistic is defined by 


d = XL.(e, — e-,)?/ tie? 
where e, = (y; — y,). An obvious APL solution is 
A <+/UH*2 
D +(+A)x+/(((0,(7-1) p1)/UH)-(( 7-1) 91,0)/UH)*2 
Try it on the UH generated by MREGRESS: 
VD<DURWAT X 
[a] T1«(pXx)-4 
[2] A<+/Xx2 
[3] De(#A)x+/(((0,71 p1)/X)-((171 p1),0)/X)*2 
[4] ‘THR DURBIN-WATSON TEST STATISTIC WITH ';T1;' DEGREES 
OF FREEDOM IS:'D 
Vv 
Y MREGRESS X 
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The computer output for MREGRESS has been suppressed to save space 
(see page 257). 


DURWAT UH 
THE DURBIN-WATSON TEST STATISTIC WITH 8 DEGREES OF FREEDOM TS: 


1.635297783 


Summary 
For a matrix X of N observations and K variables, the following 
statistics can be defined: 
Moment Matrix a 
Covariance Matrix X’X -X'X — | 
Correlation Matrix D°'(Xx'X — X'X)D~'?, where D contains the 
diagonal elements of (X’X — X'X) 
[See page 241 for the APL expressions. ] 
Sample Correlation Coefficients: 
ry = Cov(x;, x;)/ V Var(x,) Var(x)) 
where Cov(x;, x;), Var(x;), and Var(x,;) are sample values. 
Sample Partial Correlation Coefficients: 
iw = (ty — Pinta) VO = rR = 7) 
For regression statistics, see Table 13.1. The corresponding APL state- 
ments are contained in MREGRESS on page 254. 
Durbin-Watson Test Statistics: 
d = Zale; ~ €-1)?/2ie? 
where e, = (Y, — Y,). 
The APL routine is listed on page 261. 
Exercises 


APL Practice 


1. Suppose that you have an array ZZ. How would you instruct the 
computer to form a diagonal matrix D with diagonal elements those of 
ZZ? Specify an array ZZ and check your routine. 


2. Write a simple routine called CORRMAT, whose output will consist of 
the correlation matrix of the regressors with their respective names, 
i.€.! 
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THE CORRELATION MATRIX OF REGRESSORS 


NAME dd KK LL 
Jd * * * 
KK * * * 
LL * * * 


Have the program ask you for the names of the variables first and then 
print the correlation matrix. 


3. Write a program that will calculate the partial correlation coefficients 
of a set of regressors X. 


4. Write a simple routine that will give you the regression coefficients in 
the following format: 


ESTIMATED REGRESSION COEFFICIENTS 
STANDARD ERROR T-STATISTIC 


CONSTANT * * * 
COEF. OF X1 * * * 
COEF, OF X2 x * * 


Furthermore, if one of the f-statistics is smaller than 1.96, print a 
message to state which coefficient is not significantly different than 
zero at the 5% significance level. 


5. Another way to calculate the Durbin- Watson statistic is given by the 
formula 


where i is a vector of estimated error terms and 


i -l 0 0. 
-1 25) = er 0 
0 -! 2 -1 


0 oO -!1 1 


Write a short routine to calculate the Durbin-Watson statistic using this 
matrix formulation. 
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For the following list of residuals calculated as UH = 1, —1, 2, 3, —4, 


—5, 1, 3, find the Durbin-Watson statistic using the above formula and the 
DURWAT H function (page 261). Do you get the same result? If not, why not? 


Statistical Applications 


The questions in this section utilize the Data Set Watt in Appendix E. 


I. 


Consider the following one equation production model 
Y= by + bX, + bX, + U 


where Y is production in KWHR of 15 electricity-generating plants, 
X, is the input in BTU’s per firm, and 


{; if plant’s number is odd 
XxX, = 


2 if plant’s number is even 


Test the following pair of hypotheses using only the second year data 
(1967) 


A: Do = 0, by # 0 
and compare to: 


Hy: b, = 0, bp unrestricted . 
b. against 
H,:. 0b. # 0, by unrestricted 


(Hint: The column of 1’s should be used.) 


c. Does your answer regarding the importance of X, in the 
model differ between tests (a) and (b)? 


Suppose that the manager of the second power plant knows that one 
of the following two models is the true model that explains the factor 
demand for energy. The two models are 


model A Y, = dy + A,X; + QP, + Uy, 
model B Y, = bX,P, + Us, 


where Y, is input in BTU’s for year ¢, P, is the price per BTU paid by 
the second plant for year t, X, is the production in KWHR for year f, 
and the standard assumption that the distributions of U;,, i = 1,2 are 
given by: 

Uy, ~ N(O, 7) 

Ue, ~ N(0, a3) 


Show that model A is the true model. (Hint: combine the two models 
and test the coefficient on the X,P, term.) 
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3. The purchasing department of the sixth plant has the following model 
to estimate the factor demand relationship: 


Y, = dy + aX, + G2P, + a3D + U, 


where Y, is the quantity of energy utilized in BTU’s for year ¢, P, is 
the price per BTU paid by the plant in year r, X, is the output in 
KWHR for year ¢, and D is a dummy variable which takes on the 
value one if summer temperature exceeds 90°F, and the value zero if 
the temperature is less than 90°F. The summer temperatures are: 


(a) Find the estimates of the coefficients and the estimates of their 
covariance matrix. 


(b) Drop the 1975 observation. Notice that the matrix of regressors is 
now singular. However, one can obtain estimates of these pa- 
rameters d,, ad), and a*, where u* = a) + ay; that is, drop the 
variable D. What are the estimates of a,, a), and a*? Determine 
the covariance matrix of a,, a,., and a*. Compare estimates of a, 
and a, with your answers to (a). Use the appropriate column of 
the data set WATT in Appendix E. 


4. Suppose the Department of Energy wanted to know the production 
relationship between BTU and K WHR. A model that they might use 
employs averages for the variables of the industry. The model could 
be 


Y, = ay + aX, + aP, + U; 


where ¥, is the average output (in K WHR) of the plants in year ft, X; is 
the average input in BTU’s to the plants in year ¢, and P, is the 
average price per BTU paid by the plantsin yeart. Heret=1,. .. ,10. 
Estimate the coefficients of this model. 

Estimate the covariance matrix of the coefficients. 

Use data set WATT of Appendix E. 


5. The manager of the fourth plant conceived a more sophisticated 
model for his input demand. He believed that his input demand for 
fuel is explained by a partial adjustment model. The model is 


(i) YF = by + bP, + bX, 
(ii) Yo Ya SMP — Yay bY 


where Y,is BTU for year t, P, is the prices for year t, X, is KWHR for 
yeart, and Y* is the unobserved desired Y,. As before,t=1,. . . , 10. 
Using data set WATT of Appendix E: 
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(a) Show that the input demand equation is equivalent to 
Y, = boy + beyX, + (1 — y)¥i-1 + byP, + U; 


(b) Run an OLS and obtain the estimates of the coefficients. (Assume 
that Yj9,, = 700 x 10'? BTU’s). 


In the model of problem 5, notice that the first equation is determinis- 
tic; i.e., it has no error term. Suppose that the manager believed 
instead that the following model was correct: 


(i) Y# = by + DP, + bX, + U, 
(ii) Y,- Y4= y( Ve — ¥i4) 


(a) Solve for Y, as you did in part (a) of problem 5. 


(b) Estimate the coefficients using the least squares procedure dis- 
cussed in this chapter. 


Suppose that you hypothesize that current production of KWHR 
depends on current prices of the input (BTU) and the previous year’s 
prices of the input as well. Your model now becomes Y, = dy + a,P; + 
a,P,, + U, for the first plant. Test the hypothesis that your model is 
the correct model against the alternative, that KWHR depends on 
BTU’s consumed and current prices, written as Y, = dy + a,X; + oP; 
+ U, is the correct model, using data set WATT of Appendix E. 


In this application you will extend the idea of adding lagged prices to 
the demand function as was done in the previous exercise. Suppose 
that you hypothesize that it takes two to three years to adjust output 
to price changes in inputs. Formulate your model and test the hy- 
pothesis that the adjustment period is three years, using data from 
the first plant. The two models can be written as: 


(i) Y, = ay + ayPy + G2Py-1 + A3Py-2 + a4Pi-3 + Ure 
(ii) Y, = bo + bX, + boP, + Ug 


where the variables are defined in question 2. 


Consider the following model: 


Yin = a+ DXi + Vir i=1,..., 15 


where Y;,is the 10 by 1 column vector of KWHR of firm over the 10 
years, X;, is the 10 by | column vector of BTU’s used by firm/, and 
over the 10 years b;, is the coefficient relating input to output. Test 
the following two hypotheses: 


(a) The coefficient b;, is the same for all of the 15 plants, although 
they may differ over time, i.e., b;, = b, for all i. 

(b) The relationship between X and Y remains constant over time, 
but it might differ across firms, i.e., b;, = b; for all ¢. 
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10. If you have already found that the functional form of the relationship 
between prices of BTU’s and BTU’s demanded by the whole industry 


is 
BTU = a)P“e" 


(a) Estimate ap and a,. 
(b) Find the price elasticity of demand for inputs for the entire indus- 


try. 


14 


Other Simple Regression 
Equation Estimators 


You will not study econometrics very long before you discover that the 
linear multiple regression model we discussed at length in the previous 
chapter cannot handle all eventualities, and so new estimators have to be 
created. This chapter will discuss a number of these. 


14.1 Simultaneous Equation Models 


Simultaneous 
Equation Models 
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One of the most important extensions of the linear regression model is 
based upon the recognition that in most situations in economics we cannot 
consider a single regression equation in isolation, but instead must treat it 
as merely one equation within an interdependent system. In short, in this 
chapter we begin by introducing the notion of simultaneous equation mod- 
els. For an excellent introduction to this fascinating area of econometrics 
see the books by Kmenta or Johnston (listed in the bibliography). 

The most usual simple example is to consider the market for lemons (the 
fruit, not inferior cars or appliances!). If you have studied only a little 
economics you will be aware that the price and quantity of lemons ex- 
changed in the market are the result of the interaction between two behav- 
ioral relationships, one determining producers’ supply responses and one 
determining consumers’ demand responses to market prices. 

Now the statistical problem we face if we want to use observed data to 
estimate either response, or even both responses, in that we cannot blithely 
run a regression of, say, quantity traded, on price and other variables 
affecting demand and expect to get useful estimates of the parameters of 
the model. Besides, we might immediately wonder whether we should 
really be regressing price on quantity and other variables or quantity on 
price. 

This is not the place for us to settle these issues since many books, 
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indeed several very good books, have been written on the subject. We will 
proceed under the assumption that you have read or are reading a text- 
book on estimation in simultaneous equation systems. 

The estimation problem in a simultaneous equation system arises from 
the distinction between endogenous and exogenous variables. So far in 
linear regression the distinction has been straightforward—the dependent 
variable is the endogenous variable and the regressors on the right-hand 
side of the equation are the exogenous variables. Endogenous variables are 
determined by the set of equations being considered and exogenous vari- 
ables, while determined outside the system, affect the endogenous vari- 
ables through the equations. The problem in a simultaneous equation sys- 
tem is that we have more than one endogenous variable in each equation! 
The regression techniques that were suitable for the single endogenous 
variable are not suitable for the many endogenous variables case. New 
techniques are needed. We now address ourselves to one of these 
cases. 
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2SLS Regression 
Procedure 


One method of estimation for a single equation that is frequently used with 
simultaneous equation systems is called two-stage least squares (2SLS). 
Here is an example of a simplified model for food consumption from 
Kmenta, pages 563-656. 


QO, = a, + QP, + azD, + Uy (demand) 
Q, = b, + bP, + b3F, + byA, + Uy (supply) 


where Q, is food consumption per capita, P, is the ratio of food prices to 
general consumer prices, D, is disposable income in constant prices, F; is 
the ratio of the preceding year’s prices received by farmers for products to 
general consumer prices, A, is time in years, and Ug and U,, are the 
disturbance terms. The data are presented on page 285. 

In simultaneous equation systems we are usually interested in what are 
known as the structural coefficients, that is, the coefficients relating the 
effect of, say, income, on quantity demanded or price. The demand and 
supply equations shown above are structural equations. Alternatively, we 
can consider solving for each endogenous variable in terms of all the exog- 
enous variables. These latter equations are called the ‘‘reduced form’’ 
equations because the prediction of each endogenous variable has been 
reduced to a linear function of exogenous variables only. Of course, the 
reduced form coefficients are in general complicated functions of the struc- 
tural coefficients. For example, the reduced form equations for the above 
simple model are 


QO: = Yo + yiDi + yoF + ysAr + Vie 

P, = 8y + 8,D; + 8oF; + 83A, + Voy 
and the reduced form coefficients expressed as functions of the structural 
coefficients are 


270 


Other Simple Regression Equation Estimators 


_ bea, — azb, _ b2Qs 
Yo by — ae Nn by — az 
_ —Agbz _ —Azb, 
Y ‘by — ay %3 ~~ by — ae 
_ by ~ a _ 7 a3 
bo = by — ay ab a, 
_ __ bs _ bs 
oo = Ba 83 = Ba 
bLU, a,U, Og — U, 
Ve= = be - a, Vor = by ~ a, 


In this simple model Q, and P, are the endogenous variables and D,, F,, and 
A, are the exogenous variables. 

The 2SLS procedure starts with estimates of the coefficients of the re- 
duced form equation, wherein each endogenous variable is a function of all 
the exogenous variables in the system. The second stage is to use predicted 
values of the endogenous variables as regressors in the regression equa- 
tion, relating one endogenous variable to the others in the structural equa- 
tion. These first round coefficients in the price equation are computed by 


V+eP F1,D,F,A 


The only thing new here is the catenation of D, F, A and a column of 
1’s, which is a simple extension of the way in which we used catenate 
earlier. Recall that the computer operates from right to left. You might 
type 1, D, F, A to see the matrix in full. 

Next we compute the ‘‘estimated’’ or ‘‘fitted’’ values of the P list by 
multiplying each of the coefficients by the appropriate data value. For the 
first sample point we have 


99.628 = 90.278 (1) + 0.663 (87.4) — 0.488 (98.0) —0.737(1) 


In order to compute the vector of all fitted values we can use matrix 
multiply 


(1,D,F,A) +.x V 
and to store the result in a 20 by 1 vector we would write 
W+ 20 1» (1,D,F,A) +.x V 


You may wish to type VV and display the 20 fitted values. This completes 
the first stage of the procedure. 

The second stage is to estimate the structural equations using the fitted 
values of P (stored as VV) in place of the original values of P. For the 
demand equation, this is accomplished by 


O1,VV,D 

94,63333 These are the 2SLS estimators 
of the structural coefficients 

0.24355 Q,, Ao, Ay. 


0.31339 
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The structural equation is 
O, = 94.63333 — 0.2436 P, + 0.3140 D, 
Similarly, the supply equation is 


Qelt, VV, F, A 


49 .53244 

0.24007 These are the 2SLS estimates of the 
° structural coefficients b,, b,, bs, bg. 

0.25561 

0.25293? 


or 


O, = 49.53244 + 0.24007 P, + 0.25561 F, + 0.25292 A, 

You may have noticed that the supply equation is exactly identified; that 
is, the number of excluded exogenous variables from that equation is equal 
to the number of endogenous variables included in the regression as regres- 
sors. The excluded exogenous variable is D, and the included endogenous 
variable is P,. 

In the exactly identified case we can derive estimates of the structural 
coefficients directly from the estimates of the coefficients of the reduced 
form equation. This is usually an exercise in algebra, where the values of 
the structural coefficients to be estimated are obtained by solving the equa- 
tions relating the structural and reduced form coefficients in the exactly 
identified equation. To accomplish this we can estimate the reduced form 
equations for P and Q. 


QH1,D,F,A PH1,D.F.A 


71.20355 {The estimates of 90.26776 


the reduced form The estimates of the 


0.15922 . 0.66321 reduced form coeffi- 
coefficients yo, : 
cients 6), ... , 53. 
0.13834 sy Ys- -O.48845 
0.07598 -0.737039 


Algebraically, we have 
O = 71.20355 + 0.15922D + 0.13834F + 0.075978A 
= B, + B.(90.26776 + 0.66321D — 0.48845F — 0.737039A) 
+ B,F + BA 
After some algebraic manipulation we obtain 


_ 0.15922 
2 ~ 0.66321 


Bz = 0.13834 — (0.24007)(—0.48845) = 0.2556 


etc., which are in fact our 2SLS results. The other coefficients are left as an 


= 0.24007 
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exercise for the reader. It should be clear that even when the equations are 
exactly identified, 2SLS is a much simpler procedure than is indirect least 
squares. Of course, when the equation is overidentified (more excluded 
exogenous variables than included endogenous ones), you cannot use the 
indirect least-squares procedure, but you can use 2SLS. 

As a final example, you may want to see what results you would have 
obtained if you had performed ordinary least squares (O.L.S.) on the 
structural equations. (Remember that these estimates are statistically in- 
consistent.) 

The demand equation would be 


QH1,P,D 
94.63333 
99.89542 — . 
Dre sor ae ce timates 0.24355 
-~0.31630 e 2SLS estimates> 
0.31399 
0, 33464 
and the supply equation would be 
QH1,P,F.,A 
49,53244 
58.27543 
, <The O.L.S. estimates 0.24007 
0.16037 : 
The 2SLS estimates—> 0.25564 
0.24813 
0.25292 
0.24830 


The result of this section is that you can use APL to compute estimates 
that have desirable statistical properties. The procedures are not difficult. 
Even in the case of exactly identified equations, 2SLS using APL is easier 
than the indirect least-squares approach. Two-stage least squares for the 
system we used requires two additional APL expressions over ordinary 
least squares, and that technique yields inconsistent estimates. 

An alternative, but related, estimator that we will develop is the instru- 
mental variable estimator. It is related to 2SLS in that the fitted values 
from the reduced form which we computed can serve as ‘‘instruments’’ for 
the endogenous variables. 
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Instrumental 
Variables 


In those situations in which the conditional distribution of the disturbance 
term is not independent of the regressors for whatever reason (some exam- 
ples of this problem are ‘‘errors in variables’’ and simultaneous equation 
models), a useful approach to obtain estimators with desirable statistical 
properties is that of instrumental variables. More formally, but easier to 
understand, consider the model 


Y = xX B + U 
(N x 1) (Nx K) (Kx DD (Nx ID 
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where the dimensions of the arrays are given in parentheses below each 
array. The main difference between this model and that of Chapter 13 is 
that here we assume that the distribution of the disturbance term U is not 
independent of X, while in Chapter 13 we did assume such independence. 

Now if (and it is a big IF), we can find another matrix Z of dimension 
(N x K) such that U is independent of Z and the columns of A have 
positive coefficients of determination with the columns of X, then Z is said 
to be a matrix of instrumental variables for X. You might care to remember 
that the nonindependence of X and U may be caused by only one column of 
X, say x,, in which case the variables x.,.. . , x, can “‘serve as their own 
instruments.”’ 

Well, if we have an instrumental matrix Z, how do we use it? The easy 
way to do it is this: transform the regression Y = X¥B + U to 


ZY=2XB+ZU 
and now carry out an ordinary least-squares regression of Z'Y on Z'X 


where, with proper choice of Z, Z'X,a(K x K) matrix, is nonsingular. The 
mathematical solution is 


BE (the estimatcr of B) 


(Z'X)'Z'Y 
B+ (2'X)"'Z’U 


il 


The corresponding covariance matrix of the regression coefficient es- 
timators is defined by 


(ZX) MZIZ)(XIZ) 


where o” is the variance of the disturbance term U. The required APL 
expressions are 


BE+( (QZ) +.xY EC ZX«(QZ ) +. xX) 
VARU*( +/( UH«( Y-X+.xBE))*2)4-/(pX) 


COVBE*VARU&x (ZX) +. xCQZ+..*Z) +. x Q2ZX) 


In order to illustrate the use of instrumental variables, we will write a 
very simple and straightforward routine to do the necessary calculations, 
but we’ll omit all the frills that we put into MREGRESS; these you can add 
later. Consider 


VBE<INSTRVAR 
[i] BE+((QZ)+.*xY)BCZX<+(QZ)+.xX) 


[2] VARU<(+/(UH+( Y-X+.*BE))*2)+-/(pX) 
C3] COVBE<VARUx(HZX)+.*((QZ)+.*Z)+. xH(QZX) 


fu] Vv 
We will check the routine with some data cited by Kmenta, page 313. 
Y 


0.768 0.433 0.4575 0.5002 0.3462 0.3068 0.3787 0.1188 0.1379 
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“0.2001 “0.3845 

x 
3.5459 3.2367 3.2865 3.3202 3.1585 3.1529 3.2101 2.6066 
2.4872 2.428 2.318 

Z 
3.4241 3.1748 3.1686 3.2989 3.1742 3.0492 3.1175 2.5681 
2.5682 2.6364 2.5703 


X+ &2 119 ((11 p1).%) 


x 

1 3.5459 
1 3.2367 
1 3.2865 
1 3.3202 
1 3.1585 
1 3.1529 
1 3.2101 
1 2.6066 
1 2.4872 
1 2.428 

1 2.318 


Z-Q2 1ip(( 141 e1).4) 


Z 
4 3.4244 
1 3.1748 
1 3.1686 
4 3.2989 
4 3.1742 
4 3.0492 
4 3.1175 
1 2.5681 


1 2.5682 
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1 2.6364 
1 2.5703 
INSTRVAR 


“2.297911012 0.8435302294 
VARU 
0.002081788327 
COVBE 
0.01070128568 ~0.00353069422 
~0.00353069422 0.001185860302 


14.4 Aitken’s Generalized Least Squares 


The next estimator that we consider is Aitken’s generalized least-squares 
estimator. Suppose that the model we want to estimate is 


Y= XB+U 


and the distribution of U has a null mean vector (of dimension N) and a 
covariance matrix & which is of dimension (N x N) and, in general, has 
nonzero terms off the diagonal. Suppose = is known. If so, an appropriate 
estimator in such a situation is Aitken’s generalized least-squares es- 
timator. It is defined mathematically by 


B= (XD UX) 'UXDIY 
with corresponding covariance matrix 
(X' 271 X)" 
A routine that calculates this estimator is very simple. Some suitable 
data are 


Y 
160 160 180 200 210 220 230 250 200 220 230 300 310 340 350 
300 400 450 540 
x 
2000 2000 2000 2000 2000 2000 2000 2000 4000 4000 4000 4000 
4000 4000 4000 6000 6000 6000 6000 6000 
SIG is a list whose elements are the values of the diagonal elements of > 
SIG 
0.425 0.125 0.125 90.125 0.125 0.125 0.125 0.125 0.1428571429 


276 


Aitken’s 
Generalized Least 
Squares 


Other Simple Regression Equation Estimators 


0.1428571429 0.1428571429 0.1428571429 0.1428571429 
0.2428571429 0.1428571429 0.2 0.2 0.2 0.2 0.2 

These data are from Kmenta, page 259. 
ZQZe2 20 pZey,Xx 
Z 

160 2000 

160 2000 

180 2000 

200 2000 

210 2000 

220 2000 

230 2000 

250 2000 

200 4000 

220 4000 

230 4000 

300 4000 

310 4000 

340 4000 

350 4000 

300 6000 

300 6000 


400 6000 


450 6000 
540 6000 


A useful routine for calculating Aitken’s generalized least squares is 


VATTKNGLSLUIV 


VBE«Z, ATTKNGLS SIG 
[1J ¥<2031] 
£2] X*Z03(1(92Z)02)-1)+1] 
[3] WN+NxN« 9 STG 
fu] (N,N)p (NN 01,N 00)\STG 


Regression with 
Restricted 
Coefficients 
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[5] BE(COVBEHEH(Q X)+.x€ BH SIG)+.xX)+.x(QX)+.x( SIG) +.x¥ 
Vv 


Z AITNKGLS STG 
0.073151785714 


The only bit of APL programming which is a little different is in line [4]. 
What this line does is to produce a diagonal matrix with the elements of the 
list SJG on the main diagonal.(V NV p 1,N p 0)produces a list of length N? 
of 1’s and 0’s such that when it is used to expand SIG the N? elements are 
the rows of the required diagonal matrix written out in a list. 

The Aitken’s generalized least-squares estimator is mathematically 
equivalent to the estimator obtained by ordinary least squares on 


PY=P'XB+ P'U 


where P’ is an N X N matrix which satisfies P’{P = J. Such a P can 
always be found when * is nonsingular. 

Another linear regression situation for which a different estimator is 
required occurs when the regression coefficients are known to satisfy cer- 
tain linear constraints. The model is 


Y=XB+U 
r= RB 


where the (Q X 1) vector r is known, as is the (Q x K) matrix of coeffi- 
cients R. In this situation the mathematical expression for the restricted 
least squares estimator is 


BER = BE + (X'X) R'[R(X'X) 1R'}(r — (R)BE) 


where BE is the unrestricted ordinary least-squares estimator and BER is 
the restricted least-squares estimator. If V represents the covariance ma- 
trix of the unrestricted estimator, then the covariance matrix of the re- 
stricted estimator is given mathematically by 


COVBER = V — VR'(RVR') 'RV 


Expressing these functions in APL is, of course, by now not difficult. For 
example: 


VBERCRESLS 
[iJ] B<YElX 
C2] QHeR+. x( XIX) (QX)4+.xX) x. xQR 
(3] BER<B+XIX+.x (QR)+.xQ+.x (R2-R+.xB) 


[4] VARUH+(+/(¥-X+.xBER)*2)+ (-/(pX)) 


[5] COVBER<VARUx( XIX-XIX+.x(QR+.x(HR)+.xXIX+. xR) +. xh+.xXTX) 


Let’s use the following data to try the above function: 
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100 106 107 120 110 116 123 133 137 


x 
4 4 1 
1 1.04 "0.99 
4 1.06 1.1 
1 1.4114 1.26 
1 1.11 1.13 
1 1.15 1.03 
1 1.2 1.02 
1 1.24 1.03 
1 1.26 0.98 

R 
100 
110 

f2 
10 

RESLS 


1 71 109.6135479 
VARU 


373.1395307 


COVBER 
2.651315829F'12 ~3.976973744F 12 0.000000000F0 
~2.651315829F 12 4.391241842R 12 “1.988486872E 12 
0.000000000F0 “1.077097056E 12 3.666426234E1 


14.5 Durbin’s Estimator in First Order Autoregressive Models 


All of the estimators so far have been defined in terms of some linear 
transformation of the regressand vector Y; i.e., all the estimators have been 
of the type 


B=AY 


where A is some suitable matrix of Known constants. 


First Order 
Autoregressive 
Model 
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We come now to some estimators which are no longer linear in this sense 
and which involve iterated solutions. The basic idea is that we start with 
some reasonable idea for A, say A°, and from that we get 


B' = A°Y 
We use B' to calculate an A', from which we get 
B? = A'Y 


and so on. Usually, it is convenient and statistically justified to stop after 
obtaining B?. The iterated nature of such estimators is sometimes not clear 
from the formulas in the textbooks, especially when one can obtain B? as a 
function (through A') of A° and Y directly. Our first example of such an 
iterated estimator is in the context of a time series model. 

The model we are considering is 


Y=XB+U 


where Y is (T X 1), Xis (T x K), Bis (K xX 1), andU is (T x 1). The model 
is similar to our previous models except that the covariance matrix of U is 
given by: 


1 p p p 
p 1 pt? 

V = ao . . = o> 
p™! p™? ee ee ee we 1 


where pis the autocorrelation parameter, ~1 < p < 1; 1.e., we are assum- 
ing that U, = pU;., + e,, where e, is normally distributed with zero mean 
and constant variance, and e,, e, for t # s are statistically independent. 

There are several methods of estimating a model of this type when p is 
unknown. (Of course, if p is known, then Aitken’s generalized least 
squares, which was discussed above, can be used.) One of the best meth- 
ods is a procedure due to Durbin (see, for example, the discussion in 
Johnston*). The regression model can be transformed to 


Y,; = Y,-1p + XB _ X,-1B p + E, 


where Y, denotes the array Y with the first observation deleted, Y,_, de- 
notes the array Y with the /Jast observation deleted, YX, is the X matrix with 
the first row deleted, and X,_, is the X matrix with the last row deleted. 
Fis the error term. There are now only T — | observations. The regres- 
sion is a two-step procedure. 


First Step: Use ordinary least squares to get an estimate r for p in 
Y= Yiip + X,B-— X,,Bp + E; 

Second Step: (i) Transform variables to get (Y, — rY,_,), (X; — rX:-1)3 
(ii) Use ordinary least squares on the transformed vari- 


* J. Johnston, Econometric Methods, McGraw-Hill, New York, pp. 192~199. 2nd Ed., 1972. 
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ables to get an estimate for B; i.e., use (Y, — r¥,-,) = 
(X, —_ rX,_)B + E,. 


In terms of the APL programming of the routine, we need to obtain the 
‘‘lagged’’ arrays Y,_1, X;-1, perform an ordinary least squares (OLS) regres- 
sions, transform variables, and do another OLS regression. Consider the 
following effort: 


VBE<Y AUTOREG X 
[1] ¥Ee(L4e((((@ ¥)-4)e1),0))/y {Mula lagsing of variables in 
lines [1] to [4]. 
[2] Ye ( L2(0,((p ¥)-1)p 1))/¥ 
[2.2] K+2+ (p X)L2]x2 
[3] XI<DA7X 
[4] X<L24X 
[5] BE<YE) (YL, ((pYL) 01),XL,X) (First step regression, line [5]. 
[el] Repel 1] 


[7] Y<( Y-RxYL) {igre variables, lines 
[6] to [8]. 
[8] &* ( X-RxXL) 


[9] BE*YEX {Second step regression [9]. 
[.5] aX MATRIX TS ASSUMED TO BE WITHOUT CONST. 
[10] Vv 


Let’s try an example. In order to see the difference between the esti- 
mates, calculate YEIX with the original variables. First of all, you might wish 
to check that (YZ,((p YZ)p 1),XL,X) gives the expected matrix. Let’s 
reuse some familiar data. 


Y 
11.06 4.07 1.2 1.1 1.16 1.23 1.22 1.27 


po Y 
9g 

x 
1 1 
2.04 0.99 
1.06 1.1 
1.11 1.26 
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1.15 1.03 
1.2 1.02 
1.13 1.03 
1.26 0.98 
ox 
9 2 
YL 


11.06 1.07 1.2 1.1 1.16 1.23 1.33 


Y 
11.06 1.07 1.2 1.1 1.16 1.23 1.33 1.37 


xD 
1 1 
1.04 0.99 
1.06 1.1 
1.11 1.26 
1.41 1.13 
1.15 1.03 
1.2 1.02 
1,24 1.03 

L1 


1diaa41414110 
L2 
0111411414141 
(YL, ((o YL) p1),XL,X) 


1 1 1 1 

1.06 1 1.04 0.99 
1.07 1 1.06 1.4 
1.2 1 1.11 1.26 
1.1 1 1.11 1.13 
1.16 1 1.15 1.03 


1.23 1 1.2 1.02 


24 
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The result from the routine AUTOREG is simple: 
Y AUTOREG X 
1.16889455 0.1677267818 


14.6 k-Class Estimators in Simultaneous Equation Systems (OLS, 2 SLS, and Limited 
Information Maximum Likelihood) 


k-Class Estimators 


The last estimator to be discussed in this chapter is in fact a class of 
estimators—the ‘‘k’’ class estimators to be precise—which occur in the 
estimation of regression models embedded within a system of simulta- 
neous equations. The linear regression model is now written in the form 


Y = Y, G, + Xy B, + U 
(Tx 1) (Tx M,) (M,x 1) (TX K,) (Kix) (Tx 1) 


where the dimensions of the arrays are put in parentheses below the ar- 
rays. Because the variables Y and Y, are jointly determined by a system of 
simultaneous equations, U/ is not statistically independent of Y,. Y, is 
known as the matrix of included endogenous variables this equation, X , is 
the matrix of included exogenous variables, and X, is a (J X K,) matrix of 
the exogenous variables which is called the matrix of excluded (from equa- 
tion 1) exogenous variables. We will make the simplifying assumption that 
K, = M,. Let X = (X,X,), a T X (K, + Ko) matrix. 

With such a model, an entire class of estimators can be defined by the 
normal equations as follows: 


[vir —kViV, vin] ea 7 ["” - rir) 
XIY, X1X,5 LBA | > XY 


where V, = ¥, — ¥,, % = X(X'X)"1X] Yj, and G,(k), B,(k) are the k-class 
estimators; for the algebraic details, see Kmenta, Goldberger, or any 
intermediate level econometrics textbook. If k = 0, one has the OLS 
estimator; if kK = 1, the two-stage least-squares estimator, and so on. The 
value of k need not be a preassigned constant, but could be a solution to a 
maximization problem involving the random variables, as, for example, in 
the case with the limited information maximum likelihood procedure. 

In setting up a program to obtain k-class estimators, let’s consider some 
changes to the approach we have taken so far. 

Let’s assume that stored in the computer is a matrix Z of dimension 
T X NV, where NV is the number of variables in the system of equations 
to be analyzed. Alternatively, and equivalently in APL, the data could be 
stored as NV arrays of length T. The routine to be defined below will be an 
interactive routine in that the routine, once it is called, will request input 
from the user and process it accordingly. 

The main steps in the calculation of the k-class estimators are: 


(i) Fix the value of k and determine which columns of Z are Y, Y,, X,, 
and X,; 
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(ii) Calculate V,; 
(iii) Set up the matrices appearing in the normal equations above; 
(iv) Obtain the estimates G,(A), B,(k). 


Consider the following routine: 


[1] 
[2] 
[3] 


V KCLASSEST;TND;2Z;X23;Y13;VA1;3Y393P 
"ENTER NO. OF ARRAYS OF VARIABLES' 
NV<] 
D1A+L2«L3<Lyu<NV pod 


"ENTER ARRAY NO. OF DEPENDENT VB L.! 


ND] 
TALND]+1 


"ENTER ARRAY NOS. OF ENDOGENOUS REGRESSOR VB LS.' 
NEND«] 
L2NEND +1 

‘ENTER ARRAY NOS. OF EXOGENOUS INCL REGRESSOR VB LS.' 


NEX+{] 
L3.NEX]<1 

"ENTER ARRAY NOS. OF EXOGENOUS EXCL. REGRESSOR VB LS.' 
NEXX<(] 


DLYULNEXX <1 
DULNEX <1 
"ENTER DATA ARRAYS ';NV;' IN NO. ' 


Ze 


ae 


(X2+L4/Z)) 


VH1<Y1-X2+.x((Y1*«L2/Z) 


'ENTER VALUE OF K FOR K-CLASS EST.' 


K+ 
Y<( ((QY1)+.x(24/Z) )-Kx(QVA1)+.*(24/Z)) ,£110(@L3/Z)+.xD1/2Z) 
Q<(((QY1)+.x¥1)-Kx(QVA1)+.xVA1),[2)((@Y1)+.*Z3/Z) 


P<((QL3/Z)+.x¥1),02]((QL3/2Z)+.xL3/Z) 
BE<( ,Y)EC@,01 IP) 


BE 
V 
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Some discussion is needed for this routine, as it appears to be a little 
complicated. However, the complications arise from only two sources: the 
desire to pick out easily which arrays are to be the dependent variables, 
which the included endogenous, and so on; and the wish to define a whole 
class of estimators at once. To see how simple two-stage least squares 
would be without the first requirement (having already removed the sec- 
ond), consider the following pair of lines: 


YH1«X2+ .x( YENDHX2 ) 


BE*<Ye\( YH1,X1) 


where YEND is the matrix of included endogenous variables, X2 is the 
matrix of excluded exogenous variables, Y is the dependent variable array, 
and X1 is the matrix of included exogenous regressors. This simple pair of 
statements generates 2SLS regression coefficient estimates if the appropri- 
ate arrays are already specified. 

Let’s return to the more complicated routine. The first step is to enter 
the number of variables, since we are going to extract Y, Y1, X1, and X2 
from Z by compression (i.e., use of /). For example, £2 is an array (of 
dimension NV) of 1’s and 0’s—1’s where the columns of Y1 are located and 
0’s elsewhere. The device V/«{], used to solicit a response from the user, 
has been discussed previously. 

In line [18] the user is prompted to insert data in the form of a matrix Z. 
You can do this by catenating the variable arrays which we have assumed 
are already stored in the computer. For example, if Q, P, R, S, W are the 
arrays needed, you type (after the computer prompts with []) 


: QVV Tp(Q@,P,F,5,W) 


where T is the numbers of observations. This action creates a matrix Z 
whose columns are Q, P, etc. 

Line [19] calculates V,. Lines [22] and [23], [24] define the arrays shown 
in the normal equations whose solution yields the required k-class regres- 
sion coefficient estimates; this is done in line [25]. 

Line [22] is a straightforward programming version of the mathematical 
statement wherein the two parts of the Y array are catenated together. 
Lines [23], [24], and [25] are a little more tricky in that we need the exten- 
sion of catenation called laminate, which we discussed in Chapter 12. The 
two-dimensional array Q contains (Y/ Y, — kV; V,, Y{X,), and P contains 
(Xi Y,, X1X1), while (Q,[1]P) yields the required matrix: 


[ Y, — kV; V, al 
XY; X1X, 


Line [25] raises another interesting little facet in APL. In the use of 8 
dyadically, the left argument must be a /ist of length n if the right-hand 
argument is a two-dimensional array of dimensions (n x q). The left argu- 
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ment must not be a two-dimensional array of dimensions (n x 1) or 
(1 x n). Thus, given the way in which the variable Y was created, (,Y) 
produces the appropriately dimensioned array. 

To test our function, let us consider some data from Kmenta (pages 653- 
65). The model, which we have used before, is 


OQ, = a, + a,P, + asD, + Ua (Demand Equation) 
QO, = b, + bP, + b3F, + b4A; + Us: (Supply Equation) 


Here Q, is the quantity of food consumed per head per year, P, is the ratio 
of food prices to general consumer prices, D, is disposable income, F, is the 
ratio of the previous year’s prices received by farmers to general prices, 
and A, is time in years. Q and P are regarded as endogenous; D, F, and A 
are exogenous. The demand equation is overidentified, and the supply 
equation is exactly identified. The data are given in Table 14.1. These data 
are simulated so that we Know the true model, which is 


QO, = 96.5 — 0.25 P, + 0.30 D, + Ux 
QO, = 62.5 + 0.15 P, + 0.20 F, + 0.36 A, + Uy 


Let us use our k-class function with k = 0 (gives OLS) and k = 1 (gives 
2SLS). In this demand equation, P, is Y, and i’ = (1, 1,..., 1), the D, are 
X,, and (F;,, A,) are X,. We begin by entering the arrays Q,, P;, D,, Fy, and A, 
into the computer. 

The output is self-explanatory. 


Table 14.1 Data List for Test of k-Class Estimators Routine* 


Y or OQ; 
(Dependent 
Variable) Y, or P, X, or D, X, or F; X, or A, X, ori 
98.485 100.323 87.4 98 1 1 
99.187 104.264 97.6 99.1 2 1 
102.163 103.435 96.7 99.1 3 1 
101.504 104.506 98.2 98.1 4 1 
104.24 98.001 99.8 110.8 5 1 
103.243 99.456 100.5 108.2 6 1 
103.993 101.066 103.2 105.6 7 1 
99.9 104.763 107.8 109.8 8 1 
100.35 96.446 96.6 108.7 9 1 
102.82 91.228 88.9 100.6 10 1 
95.435 93.085 75.1 81 11 1 
92.424 98.801 76.9 68.6 12 if 
94.535 102.908 84.6 70.9 13 1 
98.757 98.756 90.6 81.4 14 1 
105.797 95.119 103.1 102.3 15 1 
100.225 98.451 105.1 105 16 1 
103.522 86.498 96.4 110.5 17 1 
99.929 104.016 104.4 92.5 18 1 
105.223 105.769 110.7 89.3 19 1 
106.232 113.49 127.1 93 20 1 


« These data are stored for us (the authors) in an array called K 565. 
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First round—Ordinary Least-Squares Estimators (K = 0): 
KCLASSEST 
ENTER NO, OF ARRAYS OF VARIABLES 


6 
ENTER ARRAY NO. OF DEPENDENT VBL. 


1 
ENTER ARRAY NOS. OF ENDOGENOUS REGRESSOR VBLS. 
QO: 
2 
ENTER ARRAY NOS. OF EXOGENOUS INCL REGRESSOR VBLS. 


3 6 
ENTER ARRAY NOS. OF EXOGENOUS EXCL. REGRESSOR VBLS. 


WS 


ENTER DATA ARRAYS 6 IN NO. 


K565 
ENTER VALUE OF K FOR K-CLASS EST. 


0 


~0.3162988049 0.3346355982 99.89542291 


Second Round—2SLS Estimators (K = 1): 
KCLASSEST 
ENTER NO. OF ARRAYS OF VARTABLES 
O: 
6 
ENTER ARRAY NO. OF DEPENDENT VBL. 
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1 
ENTER ARRAY NOS. OF ENDOGENOUS REGRESSOR VBLS. 


2 
ENTER ARRAY NOS. OF EXOGENOUS INCL REGRESSOR VBLS. 


3 6 
ENTER ARRAY NOS. OF EXOGENOUS EXCL. REGRESSOR VBLS. 


45 


ENTER DATA ARRAYS 6 IN NO. 


K565 
ENTER VALUE OF K FOR K-CLASS EST. 


1 
~0.2435565378 0.3139917943 94.63330387 


Third Round—Limited Information Maximum Likelihood Estimators 


(K = 1.1739): 

KCLASSEST 
ENTER NO. OF ARRAYS OF VARIABLES 
(): 


6 
ENTER ARRAY NO. OF DEPENDENT VBL. 


1 
ENTER ARRAY NOS. OF ENDOGENOUS REGRESSOR VBLS. 
Oo: 
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ENTER ARRAY NOS. OF EXOGENOUS INCL REGRESSOR VBLS. 
EJs 
3 6 
ENTER ARRAY NOS. OF EXOGENOUS EXCL. REGRESSOR VBLS. 
O: 
h 5 
ENTER DATA ARRAYS 6 IN NO. 
O: 
K565 
ENTER VALUE OF K FOR K-CLASS EST. 
U: 


739 
“0. 2295353985 0.31001268214 93.61902556 


The value of k in the L.J.M.L. estimator is obtained by minimizing the 
ratio of the error sum of squares of the regression of (Y — Y,G,) on X, and 
on X; for an explanation, see Kmenta (page 569). 

We have now reached the end of the book. We trust that you have 
discovered by this stage the power (and beauty) of APL and, more impor- 
tantly, that in using APL you learned statistics more easily and fully than 
would otherwise have been possible. 

There is more to learn, in APL statistics and econometrics. To do 
that you will need to read some more books. As far as APL is concerned, 
you might now find it useful and interesting to look through some of the 
computer manuals recommended in the bibliography. The statistics and 
econometrics books listed are all excellent at their various and respective 
levels of sophistication and detail. 


This chapter discussed a number of APL routines for calculating 
regression estimators in some interesting extensions to the simple model. 
The first topic is that of simultaneous equations. The program to estimate 
the general A-class estimators was given. In addition, the 2SLS, indirect 
least squares, and instrumental variables estimators were discussed at 
length. 

The next topic to be discussed was Aitken’s generalized least-squares 
procedure, which is needed to estimate regression models where the 
covariance matrix of the disturbance terms is nonscalar. 

Another useful extension was to consider Durbin’s method for estimat- 
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ing a linear regression model, where the disturbance term was distributed 
according to a first order autoregressive process. 


Statistical Applications 


1. 


Write in APL the following formulas useful in simultaneous equation 
systems. 

(a) Z = QXaxn ®@ Ikxx, where @ is the mathematical Kronecker prod- 
uct. 


(b) B= [Z'(23@OX(X'X) OX )ZP'[SAQKX'X) AX YY 
(c) B= [ZLIQ@XX'X) LW IZ] OZ @OX(X'X) YY 
Let the true model be Y = 2X, + 3X. + U,, where 


X= LX, X2| = 


The U, are identically and independently distributed random vari- 
ables with mean zero and variance 1 (see the solution for a routine 
that generates independent random numbers from a normal distribu- 
tion with zero mean and variance equal to N(0, 1)). 

(a) Given X and U, find the values of Y. 


(b) Compute the variance of Y using the computed values of Y. 


Consider the following data: 


¥ X, X, 
49 35 55 
40 38 60 
46 40 70 
45 41 61 
52 42 68 
59 38 11 
55 31 59 
61 44 66 
64 38 75 
65 29 85 


where Y = the production, in thousands of bushels, of wheat of one 
farm over a period of 10 years; X, = mean January temperature; and 
X, = mean June temperature. 

Let the relationship between the variables be linear and of the form 
Y = a + A,X, + a,X_ + Uz 
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Test the following hypotheses: 


Ay: ag = 0 
against H,: a, #0 
and 


Ay: a = a4, = a =0 
against Hy: ao, a1, Qo #0 
Use a 5% significance level for your tests. 


4. Referring to exercise 3, suppose you believe that the June tempera- 
ture affects the production twice as much as does the January tem- 
perature, and you want to test if your belief is true. Mathematically, 
this can be solved by the following steps (given without proof). 


(a) Find @ = (X'X)"Xx’Y the unconstrained vector of a. 
(b) Find a = 4 + W(r — R&@) the restricted vector of a. 
where R=[0 2 -1] r=0 
W = (X'X)'R‘A 
A = [R(X’X) R'J' Notice Ra =r @ ay = 2a, 


(c) Find \} = AR(@ — @) and V)\ = o2A, the variance-covariance 
matrix of A, where o% is the variance of the residuals from the 
restricted regression.* Thus under Hy: a, = 2a, A ~ N(O, Vj), 
which is equivalent to testing 


Hig: A=0 
against H,: A #0 
(d) Another statistic that you might want to use is 
a@n-k-1 
ae Fyn-k-1 
where ui is the restricted error sum of squares, “'# is the 
unrestricted error sum of squares, and s is the number of 
constraints. 
(e) Yet another statistic that you might want to use is 


=> 


—2in == 
u 


a) 


which is distributed as chi-square with 5 degrees of freedom. 


(f) Do you get the same answers using the three different tests? 
That is, do you accept or reject your hypotheses in all three 
tests? 


5. Consider the following simple market model: 
g=2+pt+u gq’ =4-—ptov 


where qg* is quantity supplied, g¢ is quantity demanded, the equilib- 
rium condition is g* = q¢, and u~ N(O, 3) and v ~ N(O, 1), where uw 
and v are independent. 


* vis the Lagrange multiplier of the minimization problem L = u'u + 2(Ra~—r). 
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(a) Find the expected equilibrium price. 

(b) Suppose that the government initiates an inquiry whenever the 
equilibrium price is more than $1.50. Find the probability of hav- 
ing the government make an inquiry in this market. 


6. Refer to exercise 3. Suppose you believe that there is no constant 
term in that relationship, and you run the regression 


Y= 4,X, + aX, + U 


(a) Find the error sum of squares of this regression and compare it to 
error sum of squares of the regression in exercise 3. 

(b) Suppose that instead of running the previous regression, you 
decide to run 


Y=a,V + aX, + U* 


where V is the residual vector of the regression of X, on Xo, 
including a constant term. Use the same data to show that the 
estimate of a. is the same as that given by the regression of Y on 
X». 

7. Write a program that will calculate the following statistics in the 
heteroskedastic model, which is given by Y = XB + U, where the 
covariance matrix of U is %, which is diagonal with unequal elements 
on the diagonal. 

(a) B = (X'S"'X)"1X'=7'Y, the efficient estimator of the vector of 
regression coefficients, where = is a diagonal matrix whose ele- 
ments are known in advance. 

(b) The t-statistics for the B estimates. 


(c) Use the regression routine you have just written together with the 
data set WATT (Appendix E) to estimate the regression 


Y = a). + 4,X,; + aP, + U 


where Y is the K WHR of the 15 plants for the first year, X ts the BTU 
of the 15 plants for the first year, and P, is the vector of prices that the 
15 plants paid during the first year. = is a diagonal matrix with the 
numbers | to 15 on its diagonal; the numbers correspond to the size of 
the plant (1 for the smallest, 15 for the largest) measured by the total 
KWHR produced during the first year. 


8. Consider the following model that determines the equilibrium price 
and quantity of a product. 
dD, = do — a,P, + aeY, + Uy 
S, = bo + bP, + Ue; 
where D, = quantity demanded, endogenous; P, = price of product, 
endogenous; 5S, = quantity supplied, endogenous; Y., = disposable 


income, exogenous; D, = S,, the equilibrium condition. 
U,,, Us, are each identically and independently distributed normal 
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10. 


11. 


12. 


variables with means zero, variances o? and a3, respectively, and 

covariance op. 

(a) Solve for the reduced form. 

(b) Is OLS an appropriate method of estimation for the reduced form 
coefficients? 


(c) Generate by computer a sample of observations on U,,, U,, and Y, 
for 20 observations and compute values for D,, S,, and P, by 
solving the equations after picking values for the coefficients. Use 
your generated data to run OLS and 2SLS regressions on both 
equations and comment on your results. 


Consider the very simple income determination model: 
C; = ag + ary, + U, 
Y, > C; + L 
where C; = aggregate consumption, J, = Investment (exogenous), 
Y, = GNP, U, = stochastic disturbance. 
Use the data set MACRO in Appendix E. 
(a) Estimate a, and a, using OLS on equation (1). 
(b) Write the reduced form equation for C, (using equations 1 & 2). 
(c) Estimate a) and a, using OLS on the reduced form equation for C. 


Consider the following product market model for the United States. 
Use the data set MACRO. See Appendix E. Can you estimate these 
equations? 


C; = ayy, + agRLi_1 + Un 
I, = bY, + BRL, + Un 
Check to see which, if any, of these equations are identified. 


Using the money supply series in data set MACRO, estimate the 
parameters of the model: 


ML, = ay) + Q\ML,_, + U, 
A partial adjustment hypothesis could state that the target level of per 
capita consumption (C;) is a linear function of income, i.e.: 
C# =a + bY, 
Actual consumption is adjusted towards C* according to 
Ci — Cia = viCF -— Cra] + U,0<y <1 


which means that people adjust their consumption towards the de- 
sired level. However, the adjustment is not generally accomplished in 
one period (y < 1). Use the data in MACRO from 1950 to 1977 in 
Appendix FE to test this hypothesis. On the basis of your estimated 
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13. 


model predict consumption for the year 1978 and check your predic- 
tion with the actual 1978 value of consumption. 


Suppose that you want to test Milton Friedman’s permanent income 
hypothesis. You first want to translate his hypothesis into mathe- 
matical formulas and test it on real data. The entire model might be 
summarized in the following two equations: 


C; at bYp; 
Yp, = yrizo(l — y)' Yi, 


These equations can be combined to yield the following final equa- 
tion: 


C, = ay + byY, + (1 - ys + U; 


Verify that this final equation is identical to the model in exercise 12 
after substitution of C/ and reinterpret the estimated coefficients. 


Appendix A 


The Computer: Where It 
Is and How to Get 
Access to It 


A.1 Account Number and Password 


Account No. and 
Password 
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In this brief appendix you will learn how to ‘‘log-on’’ to the computer. 
First, let’s repeat that every computer center has its own administrative 
procedures for determining who can use the machine and what resources 
are allocated to those users. Some organizations make these arrangements 
for you, otherwise you go to the computer center to do it yourself. In any 
event, it’s nothing to be overly concerned about. Typically, a form is 
completed and you are given an ‘‘account number’’ and a ‘‘password,”’ 
and sometimes a budget or time constraint. The account numbers are 
usually six or fewer digits, and the passwords are eight or fewer characters 
(either digits or letters). As an example, at Stanford the ‘‘account num- 
bers’’ have two parts: a user number-J66 for the project, and a user 
group-E]. The password was H@V. (To discourage poachers, a slang term 
for people who specialize in using other people’s accounts for their own 
purposes—such as playing space war—one should keep his or her 
password confidential.) 

With this administrative chore done, you are ready to begin. First find an 
unoccupied computer terminal. The keyboard looks like any electric 
typewriter, and in fact the letters are in the same position. The end of this 
appendix contains diagrams of some typical keyboards—one for an IBM 
5120 minicomputer, an IBM-2741 terminal and a DEC-Writer H, and a 
Hewlett-Packard-HP2541A—literally hundreds of others exist, although 
most have the standard keyboard layout and features. In other respects, 
each terminal is slightly different; they hide the on-off switch in different 
places, provide extra buttons, extra keys, lights, and even electronic whis- 
tles. None of these matters should worry you, but you must be sure that 
the terminal has the special APL characters. If your terminal does not have 
these special symbols, it is probably called an ASCII (American Society 
for Coded Information Interchange) terminal. These terminals do not sup- 
port the special character set of APL; however, all is not lost. The APL 
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language allows you to use mnemonic codes in place of the special sym- 
bols. Look at the letter R; above it is the Greek letter rho, p. To represent 
p you would type the mnemonic code .RO and the computer would under- 
stand it to be the symbol p. We have included a list of mnemonic codes 
used in APL in Appendix C. These vary somewhat from computer to 
computer, and you should check with your facility if you must use one of 
these ASCII terminals. You are now ready to start, and we presume that 
you are sitting in front of the terminal. 


A.2 Log-on Procedure 


Log-on Procedures 


On the IBM-5120 minicomputer there is a white switch on the front control 
panel marked APL/BASIC; push it to the APL position and turn the red 
power (on-off) switch on. That is all there is to it. When the machine warms 
up, the TV screen will say CLEAR WS, meaning that you are ready to go 
with a clear workspace—the electronic equivalent of a blank piece of 
paper. If you have a 5120 or similar microcomputer, you can skip to the 
log-off section of this appendix. 

Those of you who are operating in a time-sharing environment will have 
to use a more complicated log-on procedure. On the 5120, which is a small 
desktop minicomputer, there is one user. On a big computer, hundreds of 
users are connected to the computer. These connections are done in one of 
two ways: via a direct wire from your terminal to the computer, or over the 
telephone system. If you see a telephone by every terminal, you know how 
your terminal is connected. 

Unfortunately the log-on procedure varies from computer center to 
computer center. However, the procedures are usually not difficult to ac- 
quire. We suggest that if you are not provided with a log-on procedure 
sheet by the computer center, you make one up yourself until you have 
everything memorized. Believe us, one of the most frustrating parts of 
programming is learning how to get programming access to the computer; 
once that is done, the rest is plain sailing. The problem here is that, as we 
mentioned above, every center has its own ideas about how people should 
log-on. 

Nevertheless, to give you an idea as to what is involved, consider the 
following procedure. Starting with the direct wire situation, you turn on 
the terminal and type the symbol ), then your account number and 
password, and then press the “‘return’’ key. This button sends one line of 
information to the computer. This could look like 


)1984:ME 


your account being number 1984 and your password ME. This password is 
sometimes called a ‘‘LOCK.”’ The computer will respond with something 
like: 


062* 10:41:32 9/10/80 Von Mises 
OPR: SYSTEM AVAILABLE TO 22:30 
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CLEAR WS 


Translating this into English, it means that you are using telephone line 
number 62 (called a ‘‘port’’ by computernics), the time is 10:41 and 32 
seconds A.M. on the tenth day of September, 1980, and the user’s name is 
Von Mises. The operator indicates that the system will be available until 
10:30 tonight. 

If, when you turned on your terminal, you could not type on it, or when 
you typed nothing happened, it might be that the terminal is in local mode. 
Look for a switch marked COM/LOC for communication and local. It 
needs to be in the COM mode to communicate with the computer. Clear 
Work Space means you are ready to begin. 

If you connect to the computer via a telephone line, you have exactly the 
same log-on procedure, with the additional task of calling the computer. 
The procedure is simple enough; first you activate a small box called a 
modem. Modem stands for Modulator Demodulator. This modem essen- 
tially takes the signals from your terminal and sends them over the tele- 
phone line, and then retranslates the signals from the computer to your 
terminal. 

These modems are of two basic types: one a Data-Phone and the other 
an acoustic coupler. The Data-Phone is usually a Western Electric device; 
you simply pick up the telephone receiver, press the ‘‘talk’’ button, and 
dial the computer’s telephone number after you get a dial tone. When the 
computer electronically answers the phone, you will hear a high pitched 
tone that indicates that the computer is ready for you. You then press the 
“‘data’’ button. 

The computer will typically send a brief message or just one 
character—a — or a *. You can now ‘“‘log-on’’ exactly as the direct-wired 
terminal user does. 

The final connecting device is the acoustic coupler. Again the terminal 
power is turned on. With this equipment, the electric power to the modem 
is also turned on, and the telephone number is called. When you hear the 
high pitched tone, push the telephone hand set into the coupler receptacle. 
Sometimes there is an indication of which direction to place the hand set, 
but if there isn’t, just take a guess. If it doesn’t work, try it the other way 
around. Again, you will get some message from the computer and you 
log-on. 

As an example, here is how it would be done at the Stanford Computer 
Facility. Our connection is hard wired, so we turn on the terminal (a 
GenCom, with the switch cleverly placed under the keyboard so that it is 
almost impossible to find). I type an ‘‘a’’ and the computer responds with a 
+, [type SCF APL and the machine responds. . . SCF 168. . . waits a 
moment and then types User?. I type J66; it responds, Group? I tell it El; it 
asks, Password? I reveal ZV (since changed) and it asks, Command?. I 
respond CALL APL; the machine types APL, then skips a line and types 
CLEAR WS. 

As another example, consider the sign-on procedure used at New York 
University when using the CUNY computer center. At this point we run 
into another option on terminals—some can be used for both APL and 
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other languages. The character set for other languages is often designated 
STD CHAR SET (Standard Character Set), and you should be in standard 
to begin. The procedure is: 


Switch on power of terminal and modem; 
Ensure character switch is set to STD; 


Dial up and place phone in acoustic coupler when high pitched tone 
comes on line (a white light on the acoustic coupler comes on if the 
connection is good); 


Type ‘‘shift’’ P 

Hit RETURN key (sometimes called EXECUTE); 

System responds: type: a for apl, w for wylbur, o for callos 

Type: a 

Hit RETURN key; 

System responds: PROJECT No., ID? 

Type: [the project number given to you by the computer center]; 

System responds: PASSWORD? 

Type: [the password you have picked and which is on file with the 
computer center]; 

Change character switch from STD to ALT; 

Type: )BLot [the ‘‘)’’ symbol is upper case] key on the APL keyboard 

(see end of this appendix)] 

System responds: XXXXXXXXKXXKXXXXKXXKXXXXKX 

Type: ) followed by APL account number, the symbol : and password; 

System responds: 


Date Your Name [as filed in computer center] 
CUNY APL 


Your log-on is now complete, and you are ready to begin. 

By now you realize that every center is different and will have its own 
special way of logging you onto the machine. However, you now know 
enough to understand the general idea of the log-on procedures used in any 
computer center. They all have handouts that explain the details and pecu- 
liarities of their own system. 


A.3 Log-off Procedure 


Logging off is simple. On the microcomputer systems you just turn the 
machine off. Of course, everything in the machine will be erased if you do 
not copy it onto a tape or disk. The same thing is true when you use the 
terminals. You must make special provision to save anything you want to 
use again. You learned how to do this in Chapter 7. If you have a 5120 or 
other microcomputer, you can skip to the last sentence of this appendix. 
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Log-off Procedure 


IBM 5120 desktop 
computer showing the 
APL character set, 
numeric pad, and 
special function keys. 


IBM 5120 showing 
keyboard characters 
that can be entered 
using the command key 
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To log-off on the terminal, you type )@FF and press the return button. 
The machine will respond, indicating that it understood your command. 
The response is something like: 


062* 13:01:04 9/10/80 Von Mises 
Started 10:41:32 Clocktime this session 2:22:32 to date 17:25:30 
CPUTime this session 0:00:10 to date 00:01:10 


The first line here is similar to the first line of your terminal session; it 
indicates your port, the current time (1:04 P.M.), the date, and your name. 
The second line repeats the time you started and computes the elapsed 


Gerald Musgrave 


This top photograph showing a 5120 desktop computer can be programmed 
in either Basic or APL with the flip of a switch. The keyboard is exactly like 
a standard typewriter in that pressing the shift key (either of the keys with 
the wide arrows on the bottom rank of keys) results in the APL characters 
being entered into the computer. A convenient feature is that by holding the 
command key (CMD, on the far left) and pressing one of the keys on the top 
row will produce an entire command. For example, holding down CMD 

and pressing | results in the command) LOAD being entered automatically. 


Gerald Musgrave 


The lower photograph shows the special overstruck characters that can be 
produced with one stroke. The command is held down and any of the 
individual keys now represents a new symbol or combination of key strokes. 
For example, pressing the CMD key and the F key results in the divide quad 
or domino function being entered. If the machine were in the Basic 
programming mode the characters input would have been entered. Using the 
CMD key saves a number of key strokes and is a handy feature. 


Operator uses Hewlett- 
Packard 2641A CRT 
terminal specifically 
designed for APL and 
featuring extensive 
communications 
capabilities. 


Closeup of Hewlett- 
Packard 2641A terminal 
keyboard showing the 
various APL characters 
and special function 
keys. 
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The operator 1s using the H-P 2641A terminal as a device to communicate 
with a central computer. In the upper right corner above the keyboard are 
two tape cassettes for data. 


The closeup photo of the Hewlett-Packard 2641A CRT terminal keyboard 
shows the APL characters, numeric pad, cursor control, and tape control 
keys. The dial in the upper left corner allows the operator to select the 
communications rate from 110 to 9600 band. The higher rates are used for 
direct or hardwired communications to the host computer and the lower 
rates are generally used when communication is over telephone lines. 


time that you were connected to the machine for this session and all your 
previous sessions. The third line tells how much computer time you have 
used. 

The final steps are to disconnect the terminal from the computer and 
turn off the electricity to the equipment. If your terminal is hard wired or 
connected via a DataPhone, just turn the electricity off. If the terminal is 


IBM 5120 showing the 
full keyboard and dual 
diskette slots. 
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connected via an acoustic coupler, you turn off the electricity to the termi- 
nal, hang up the phone, and turn off the power to the coupler. 

These log-off procedures also vary from installation to installation. At 
Stanford, to log-off you would type: )OFF , and press the return button. 
(Remember—the computer does not get your message until you send it by 
pressing the return button.) The computer responds: Command? Next, the 
user types ‘‘logoff,’’ and the computer responds with a large amount of 
information including the CPU time, elapsed time, the dollar charges to 
your account, and the words END OF SESSION. 


| @ 


Gerald Musgrave 


The IBM has the full APL character set and two diskette drives. The 
diskettes are used to store APL functions and data. 


Pictured is the MCM 
System 900 desktop 
APL computer, 
manufactured by MCM 
Computers Ltd. of 
Kingston, Ontario, 
Canada and distributed 
in the United States by 
Interactive Computer 
Systems, Inc., New 
York, New York. 
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The MCM 900 is a desktop computer that can be programmed in APL as a 
standalone device. It can also be used as a communications terminal to 
other computers using its data communication interface. A number of 
applications routines are available to assist the APL programmer in the 
areas of business management, pension administration, bilingual word 
processing, and financial accounting. 
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Longley Benchmark 


The ‘*Longley Benchmark,’’ as it has come to be called, was developed in 
a paper titled ‘“‘An Appraisal of the Least Squares Programs for the Elec- 
tronic Computer from the Point of View of the User.’’' The results were 
shocking to some since a number of well-known and extensively used 
programs produced inaccurate results. In fact some of the computed coef- 
ficients have the wrong sign! Beaton and Barone used the same data in 


OLS Regression Equation 
Y= By + BX, + BL X + B,X; + BX, ae B;X; + By Xo + u 


Regres- 
sand Regressor Variables 
xX, 
Y GNP Xs 
Total Implicit X, X; Noninstitutional 
Derived Price Gross X3 Size of Population 14 
Employ- Deflator National Unem- Armed Years of Age Xe 
ment 1954 = 100 Product ployment Forces and Over Time 
60,323 83.0 234,289 2,356 1,590 107,608 1947 
61,122 88.5 259,426 2,325 1,456 108,632 1948 
60,171 88.2 258,054 3,682 1,616 109,773 1949 
61,187 89.5 284,599 3,351 1,650 110,929 1950 
63,221 96.2 328,975 2,099 3,099 112,075 1951 
63,639 98.1 346,999 1,932 3,594 113,270 1952 
64,989 99.0 365,385 1,870 3,547 115,094 1953 
63,761 100.0 363,112 3,578 3,350 116,219 1954 
66,019 101.2 397 ,469 2,904 3,048 117,388 1955 
67,857 104.6 419,180 2,822 2,857 118,734 1956 
68,169 108.4 442,769 2,936 2,798 120,445 1957 
66,513 110.8 444,546 4,681 2,637 121,950 1958 
68,655 112.6 482,704 3,813 2,552 123,366 1959 
69,564 114.2 502,601 3,931 2,514 125,368 1960 
69,331 115.7 518,173 4,806 2,572 127,852 1961 
70,551 116.9 554,894 4,007 2,827 130,081 1962 


' Longley, James W., American Statistical Association Journal, Vol. 62, No. 31, 7 September, 1967, 
pp. 819-841. : 
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other programs, and they found similar inaccuracies.” As a special check, 
they wrote their own routine, using multiple precision arithmetic and were 
careful about the numerical analysis methods they used. Their results were 
in agreement with Longley’s calculations. So we have some confidence in 
the computed accuracy of the benchmark. We should add parenthetically 
that Beaton and Barone caution researchers on interpreting regression re- 
sults in problems of this nature. However, our point is that, ceterus 
paribus, more accuracy should be preferred to less. 

We have included the raw data and some of the published results, and 
the results from our APL runs with the & function. 


Computed Coefficients 


Longley 
Longley Computation 
Computation Using High 
Using Desk Precision 
Calculator Program 360 APL-STAT 
By — 3482258.6330 — 3482258.634597 — 3,482,258 .634594932 
B, 15.061872271 15.06187227161 15.06187227 114229 
B, —0.035819179 —0.0358 19179293 — 0.03581917929260994 
B; —2.020229803 —2.020229803818 — 2.0202298028 17922 
B, — 1.033226867 — 1.033226867 174 — 1.033226867 173836 
B, —0.051104105 ~0.051104105653 —0.05110410565236916 
Bs 1829.15146461 1829.151461614112 1829.151464613044 
Standard Error of Regression Coefficient 
Longley APL-STAT 
Xo 890420.3836 890420.3862 191836 
Xi, 84.9149 84.91492578619182 
X, 0.0335 0.03349100783754301 
Xs 0.4884 0.4883996826502304 
X, 0.2143 0.2142741633587326 
Xs 0.2261 0.226073200 1756643 
X6 455.4785 455.4785004777639 
Computed ¢-Values 
Longley APL-STAT 
to —3.9108 ~3.91080290668244 
t 0.1774 0.1773760282034131 
ty — 1.0695 — 1.0695163151363 
ty —4,1364 —4.136427347486052 
ty —4,8220 — 7,82 1985306012059 
ts —0.2261 ~0.2260511445525611 
te 4.0159 4.015889800933297 


This should give you an idea of our results compared to the benchmark. 
You might want to try this problem on your computer with the APL-STAT 
routines and also with other program packages. 

2 Beaton, A., Rubin, D., and Barone, J., ‘The Acceptability of Regression Solutions: Another Look at 


Computational Accuracy,”’ Journal of the American Statistical Association, Vol. 71, No. 353, March, 
1976, pp. 158-168. 
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APL Character Set 


Single-Strike Characters 


APL Set ASCII Set Mnemonic Name 


+ + add 
A-Z A-Z alphabetics 
A & .AN ANd 
+ <— or _ assignment 
concatenate, comma 
colon 
decimal point 
.DV DiVide 
equal to 
expand 
exponentiate 
.GT Greater Than 
left bracket 
left parenthesis 
.LT Less Than 
multiply 
numerics 
quote string 
question (roll and deal) 
reduce 
right bracket 
right parenthesis 
semicolon 
subtract 
-TK TaKe 
.US UnderScore 
.AB residue (ABsolute value) 
AL ALpha 
.BX quad (BoX) 
.CE CEiling (maximum) 


1 KX ASIN FEW oles 


oO 


oO 
Sleeves s -Pe ancy esi: 


7Oe-|[ erie eeny - 


Appendix 


Overstruck Characters 


305 


APL Set ASCII Set Mnemonics Name 
b .RV ReVersal 
& .TR TRanspose 
$ .XQ eXecute 
P .FM ForMat 
A-Z .ZA-.ZZ, underscored alphabetics 
A L@ underscored lower del 
Single-Strike Characters 
APL Set ASCII Set Mnemonic Name 
+ .DA drop (Down Arrow) 
*e .DD Dieresis 
i .DE DEcode 
V $ .DL DeL 
4 .DM DiaMond 
n .DU Down Union 
T .EN ENcode 
€ .EP EPsilon 
L .FL FLoor 
2 .GE Greater than or Equal 
> .GO GO to (branch) 
1 AO 10ta 
{ .LB Left Brace 
A .LD delta (Lower Del) 
< .LE Less than or Equal 
b -LK Left tack 
fe) .LO circle (Large O) 
> -LU Left Union 
# .NE Not Equal to 
.NG NeGation 
~ .NT NoT 
w .OM OMega 
Vv .OR OR 
} .RB Right Brace 
4 -RK Right tack 
a) .RO RhO 
C RU Right Union 
i) SO jot (Small o) 
U .UU Up Union 
Overstruck Characters 
APL Set ASCII Set Mnemonic Name 
A a LAmp (Comment) 
. ! factorial 
$ $ .DS dollar sign 
V .GD Grade Down 
4 .GU Grade Up 
r .IB I-Beam 
@ .LG LoGarithm 
nN .NN NaNd 
sa .NR NoR 
x .CB Column expansion 
7] .CR Column Rotate 
Z .CS Column Reduction 
EF} .DQ Divide Quad 
re) .OU out 
¥ .PD Protected Del 
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Overstruck Characters 


Comment 


Compress* 


Execute 


Expand* 


Factorial, Combination 


Format 


Grade Down 


Grade Up 


Logarithm 


Matrix Division 


Protected Function 


Quad Quote 


Rotate, Reverse 


Rotate, Reverse* 


Transpose 


CHARACTER 


ial 
c 
\ 
/ 
1 
B 
\ 
/ 
2 
N 
x 
P 
QO 
L 
A 
0 
v 
9 
QO 
L 
\ 
/ 


eG tS ea aC a) ec 


* These are variations of the symbols for these functions. These variations are 


used when the function is to act on the first coordinate of an array. 
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Saving Your Workspace 
on The IBM 5110 
Microcomputer 


If you want to save your workspace and you are using the 5120, you will 
use the cartridge tape. You must first ‘‘mark’’ the number of tape files on 
the tape and then transfer the workspace onto the tape. The details of 
marking the tape are not directly relevant for this discussion. However, if 


you type 
)MARK 32 2 1 


you will mark 2 tape files. The numbering of the files begins with one, and 
each file contains 32 blocks of 1024 Bytes. There is enough room to store the 
whole memory of a machine with 32,768 Bytes of core. The computer will 
respond with 


MARKED 0002 0032 
To copy the workspace onto the tape, key in 
)CONTINUE 1001 PROB 


The first 1 indicates tape drive 1 (on the machine), 001 means you are 
storing the information of the first file and you are naming it PROB. The 
5110 will respond 


CONTINUED 1001 PROB 


if all goes well. You could then take your tape out, turn the machine off and 
walk away with the tape. Next time, after turning the machine on and 
obtaining the CLEAR WS signal, you would push the tape cartridge into the 
machine and key 


)ZOAD 1001 PROB 
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The computer responds 
LOADED 1001 PROB 


and you are ready to go. 

The )SAVE , )COPY , and )PCOPY commands are used in much the same 
way as they were described in the previous section. If you skipped that 
section, you should read it now. The main differences concern marking the 
tape as you just learned, keeping track of the physical record number of 
the workspace, and the fact that continued workspaces are not automat- 
ically loaded when the computer is turned on. 

If you forget the physical location (file number) of your saved work- 
spaces, you can issue the )Z7B command. The computer will respond with 
the file number and the name of the workspace. It also displays other 
information about the file—but that does not concern us now. Two warn- 
ings about this command are appropriate. Firstly, the computer does not 
automatically rewind the tape to its beginning. You should issue the )RZ- 
WIND command to accomplish this. Secondly, a small slot, marked with 
an arrow, exists on the top of the cartridge (see the figure). You must turn 
the little wheel (a dime in the slot is perfect) until the arrow points away 
from the word ‘‘safe,’’ as in the figure. If the arrow points to ‘‘safe’’ the 
computer will not write on the tape, although you can read from the tape at 
all times. 

On other mini- and microcomputers you may save your functions and 
variables on floppy disks. This is also true for the IBM 5110 and 5120 series 
machines. The procedure for disks is similar to the one we outlined for 


tapes. 
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Data Set ‘Macro’ 


A. GNP and its components 


1. GNP Billions of 1972 US $ Y 
2. Consumption Billions of 1972 US $ Cc 
a) Nondurables Billions of 1972 US $ CN 
b) Durables Billions of 1972 US $ CD 
3. Investment Billions of 1972 US $ I 
4. Government Billions of 1972 US $ G 
5. Imports Billions of 1972 US $ IM 
6. Exports Billions of 1972 US $ xX 
B. Income 
1. Disposable Income Billions of 1972 US $ YD 
C. Prices, Wages, Interest Rates 
1. GNP Deflator =100 in 1972 P 
2. Short-term interest rates % per annum RS 
3. Long-term interest rates % per annum RL 
4. Wage rate in current US $’s Ww 
per week 


D. Employment 


1. Employment Thousands of persons L 

2. Unemployment % U 
E. Assets 

1. Capital Stock Billions of US $ K 
F. Money Supply Billions of US $ ML 


Source: Economic report of the President, January 1979. Series only from 1950 to 1978. 
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Units of Measurement Abbreviations 


Year Y Cc I G x IM YD 
1950 533.5 338.1 93.7 97.7 21.7 17.7 205.5 
1951 576.5 342.3 94.1 132.7 25.9 18.5 224.8 
1952 598.5 350.9 83.2 159.5 24.9 20.0 236.4 
1953 621.8 364.2 85.6 170.0 23.8 21.8 250.7 
1954 613.7 370.9 83.4 154.9 25.3 20.8 255.7 
1955 654.8 395.1 104.1 150.9 27.9 23.2 273.4 
1956 668.8 406.3 102.9 152.4 32.3 25.0 291.3 
1957 680.9 414.7 97.2 160.1 34.8 26.0 306.9 
1958 679.5 419.0 87.7 169.3 30.7 27.2 317.1 
1959 720.4 441.5 107.4 170.7 31.5 30.6 336.1 


Appendix 


Data Set Macro (Continued) 


311 


Year Y Cc I G x IM YD 
1960 736.8 453.0 105.4 172.9 35.8 30.3 349.4 
1961 755.3 462.2 103.6 182.8 37.0 30.3 362.9 
1962 799.1 482.9 117.4 193.1 39.6 33.9 383.9 
1963 830.7 501.4 124.5 197.6 42.2 35.0 402.8 
1964 874.4 528.7 132.1 202.7 47.8 36.9 437.0 
1965 925.9 558.1 150.1 209.6 49.1 41.0 472.2 
1966 981.0 586.1 161.3 229.3 51.6 47.3 510.4 
1967 1,007.7 603.2 152.7 248.3 54.2 50.7 544.5 
1968 1,051.8 633.4 159.5 259.2 58.5 58.9 588.1 
1969 1,078.8 655.4 168.0 256.7 62.2 63.5 630.4 
1970 1,075.3 668.9 154.7 250.2 67.1 65.7 685.9 
1971 1,107.5 691.9 166.8 249.4 67.9 68.5 742.8 
1972 1,171.1 733.0 188.3 253.1 72.7 75.9 801.3 
1973 1,235.0 767.7 207.2 252.5 87.4 79.9 901.7 
1974 1,217.8 760.7 183.6 257.7 93.0 TWA 984.6 
1975 1,202.3 774.6 142.6 262.6 90.0 67.5 1,086.7 
1976 1,271.0 819.4 173.4 262.8 95.9 80.5 1,184.4 
1977 1,332.7 857.7 196.3 269.2 98.2 88.7 1,303.0 
1978 1,385.1 891.2 210.1 275.2 107.3 98.7 1,451.2 
Data Set Macro (Continued) 

Year P RL RS WwW L 
1950 53.64 2.62 1.45 53.13 58,918 
1951 57.27 2.86 2.16 57.86 59,961 
1952 58.00 2.96 2.33 60.65 60,250 
1953 58.88 3.20 2.52 63.76 61,179 
1954 59.69 2.90 1.58 64.52 60,109 
1955 60.98 3.06 2.18 67.72 62,170 
1956 62.90 3.36 3.31 70.74 63,799 
1957 65.02 3.89 3.81 73.33 64,071 
1958 66.06 3.79 2.46 75.08 63,036 
1959 67.52 4.38 3.97 78.78 64,630 
1960 68.67 4.41 3.85 80.67 65,778 
1961 69.28 4.35 2.97 82.60 65,746 
1962 70.55 4.33 3.26 85.91 66,702 
1963 71.59 4.26 3,55 88.46 67,762 
1964 72.71 4.40 3.97 91.33 69,305 
1965 74.32 4.49 4.38 95.45 71,088 
1966 76.76 5.13 5.55 98.82 72,895 
1967 79.02 5.51 5.10 101.84 74,372 
1968 82.57 6.18 5.90 107.73 75,920 
1969 86.72 7.03 7.83 114.61 77,902 
1970 91.36 8.04 7.72 119.83 78,627 
1971 96.02 7.39 5.11 127.31 79,120 
1972 100.00 7.21 4.69 136.90 81,702 
1973 105.80 7.44 8.15 145.39 84,409 
1974 116.02 8.57 9.87 154.76 85,935 
1975 127.15 8.83 6.33 163.53 84,783 
1976 133.76 8.43 5.35 175.45 87,485 
1977 141.61 8.02 5.60 188.64 90,546 
1978 152.09 8.73 7.99 203.34 94,373 
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Data Set Macro (Continued) 


Year U ML K 
1950 5.3 0 0 
1951 3.3 0 0 
1952 3.0 0 0 
1953 2.9 128.8 0 
1954 5.5 132.3 0 
1955 4.4 135.2 0 
1956 4.1 136.9 0 
1957 4.3 135.9 0 
1958 6.8 141.1 0 
1959 5.5 143.4 0 
1960 5.5 144.2 0 
1961 6.7 148.7 254.7 
1962 5.5 150.9 269.7 
1963 5.7 156.5 288.2 
1964 5.2 163.7 305.6 
1965 4.5 171.4 336.0 
1966 3.8 175.8 364.0 
1967 3.8 187.4 386.2 
1968 3.6 202.5 426.5 
1969 3.5 209.0 473.6 
1970 4.9 219.7 492.3 
1971 5.9 234.0 529.6 
1972 5.6 255.3 599.3 
1973 4.9 270.5 697.8 
1974 5.6 282.9 734.6 
1975 8.5 295.2 756.3 
1976 V7 313.5 823.1 
1977 7.0 338.5 900.1 
1978 6.0 361.1 953.6 
° Not available 
Data Set Watt* 
List of the Plants 

Number Plant Name 

1 J. H. Cambell 

2 B. C. Cobb 

3 Dan E. Kern 

4 J. R. Whiting 

5 Rock River (Wis.) 

6 Nelson Dewey (Wis.) 

7 Minn. Power and Light Co., Clay Boswell 

8 St. Clair 

9 Presque Isle 

10 Gulf Power (Scholtz) 

11 Gulf Power (Lansing Smith) 

12 Tampa Electric Company (F. J. Gannon) 

13 Northern States Power Co. (Lawrence) 

14 Montana Dakota Utilities Co. (Heshett) 

15 Appalachian Power Co., Clinch River 


Source: U.S. Federal Power Commission, Steam Electric Plant Construction Cost and Annual Pro- 
duction Expenses 1966-1977. 
U.S. Energy Information Administration 1974. 
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Data Set Watt 


The data set WATT consists of three tables, the KWHR produced in 
Table 1, the BTU’s used in Table 2, and the prices of BTU’s in Table 3. The 
observations are for 15 plants over a period of 10 years. Thus, WATT is a3 
by 10 by 16 (including the year column) three-dimensional array. 


Table 1 Number of KWHR in millions 
Plant Number 


Year 1 2 3 4 5 
1966 1,938.4 2,754.9 4,066.6 2,575.7 927.7 
1967 2,980.4 2,752.4 3,878.5 2,425.5 977.8 
1968 3,904.3 2,923.6 4,055.3 2,442.2 971.7 
1969 3,678 2,907.9 3,900.7 2,393.9 987.9 
1970 4,273.9 2,901.9 3,590.3 2,276.2 926.5 
1971 3,316.7 3,244.4 3,558.2 2,190.1 807.5 
1972 3,971.3 2,874.3 3,489.9 1,921.2 827.8 
1973 4,103.7 3,449.8 3,438.2 2,211.8 862.4 
1974 3,921.4 3,506.8 3,328.9 2,440.5 727.5 
1975 3,460.9 2,869.6 3,032.5 2,155.4 546.9 
Year 6 7 8 9 10 
1966 1,393 892.2 9,792.8 854.2 246.0 
1967 1,453.1 1,046.5 9,468.6 1,114.6 233.6 
1968 1,505.9 1,023.2 9,373.4 1,135.4 313.4 
1969 1,454.9 941.2 11,671.6 1,229.6 415.4 
1970 1,275.0 947.9 12,620.4 1,161.4 389.4 
1971 1,286.1 916.5 12,606.2 1,086.8 430.9 
1972 1,439.8 945.6 12,386.8 1,121.0 433.3 
1973 1,371.4 2,007.4 10,509.6 1,136.4 505.9 
1974 1,126.5 2,773.6 9,958.2 1,130.8 462.2 
1975 1,186.9 3,192.1 8,266.0 1,795.7 489 
Year 11 12 13 14 15 
1966 1,006.8 4,481.7 128 355.3 5,660.3 
1967 1,648.4 4,666.5 176.2 462.5 5,345.6 
1968 2,231.9 5,078.1 144.8 510.7 5,191.0 
1969 2,173.2 4,954.9 113.2 497.7 5,353.9 
1970 1,961.5 5,350.0 124.5 564.8 5,223.7 
1971 1,754.6 4,736.7 133.3 583.2 5,052.4 
1972 1,827.5 5,136.3 167.7 612.9 5,484.9 
1973 2,043.8 4,880.3 168.1 575.9 5,575.2 
1974 1,722.8 3,821.6 150.3 610.8 4,974.5 
1975 1,592.3 4,392.5 40.8 602.9 4,202.2 
Table 2. Number of BTU’s in 10'? Units 
Plant Number 
Year l 2 3 4 5 
1966 663.2 1,049.4 1,376.2 986.4 402.9 
1967 1,093.6 1,066.4 1,331.9 942.3 413.9 
1968 1,429.9 1,133.6 1,426.6 955 425.7 
1969 1,349.5 1,138.9 1,384.6 955.2 440.5 
1970 1,569.1 1,200.4 1,308.0 922 431.3 
1971 1,237.8 1,363.0 1,302.6 913.1 378.1 
1972 1,497.0 1,190.3 1,289.7 790.7 389.1 
1973 1,540.4 1,424.3 1,282.9 831.3 414.2 
1974 1,488.7 1,474.0 1,258.1 972 362.6 
1975 1,355.4 1,238.1 1,128.8 838.4 291.4 
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Table 2 (Continued) 


Plant Number 


Year 6 7 8 9 10 
1966 601 397.8 3,765.6 362.5 118.6 
1967 625.6 426.9 3,633.5 454.2 110.2 
1968 648.9 411.6 3,592.5 465.9 144.5 
1969 631.6 485.5 4,483.7 507.6 192.5 
1970 560.4 583.0 4,797.9 487.7 186 
1971 565.4 570.2 4,788.3 476.8 218.4 
1972 638.1 605.8 4,508.5 511.5 224.3 
1973 605.2 1,143.4 3,560.2 516.4 241.6 
1974 507.2 1,707.6 473.9 549.4 225 
1975 600.8 2,063.0 2,434.0 794.3 235.5 

Plant Number 
Year 11 12 13 14 15 
1966 422.1 1,965.5 18.3 330.2 2,069.5 
1967 699.6 2,113.3 28.1 425.1 1,964.8 
1968 941.5 2,340.2 16 467.3 1,912.5 
1969 930.1 2,285.8 26.5 463.2 1,990.1 
1970 868 2,466.3 31.8 §20.2 1,972.0 
1971 770.9 2,213.4 46.9 541.9 1,917.9 
1972 849.0 2,420.6 52.6 571.3 2,068.5 
1973 893.3 2,246.3 47.7 $42.3 2,145.4 
1974 771.3 1,808.1 59.6 575.9 1,948.6 
1975 716.2 1,039.9 32.5 577.3 1,609.2 
Table 3 Cost per Million BTU in ¢ 

Plant Number 
Year 1 2 3 4 5 
1966 30.40 29.40 30.7 27.20 31.79 
1967 27.30 29.90 31.80 27.70 32.38 
1968 27.60 31.70 31.70 28.40 33.36 
1969 29.00 34.00 31.80 30.10 34.94 
1970 30.41 39.90 40.20 33.20 40.39 
1971 34,10 45.50 46.60 41.60 47.80 
1972 37.92 46.75 46.99 43,91 51.99 
1973 44.00 49.20 48.34 61.43 55,49 
1974 57.70 57.70 66.69 116.67 85.66 
1975 111.90 101.30 104.9 142.40 133.90 

Plant Number 
Year 6 7 8 9 10 
1966 26.78 35.54 28.30 32.44 30.01 
1967 26.32 35.00 28.50 33.54 29.88 
1968 26.62 36.00 29.20 34.29 30.15 
1969 28.41 36.00 29.40 35.96 31.96 
1970 30.78 32.90 32.90 41.53 36.52 
1971 34.11 31.70 39.70 51.30 43.70 
1972 38.06 31.85 43.60 56.94 51.50 
1973 43.37 31.68 47.71 64.10 54.80 
1974 65.49 34.38 86.90 96.98 102.39 
1975 96.60 62.00 106.00 134.03 136.83 
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Table 3 (Continued) 
Plant Number 


Year 11 12 13 14 15 

1966 25.07 26.66 38.93 20.95 17.31 
1967 25.29 26.13 38.18 20.52 17.71 
1968 25.67 26.36 39.23 20.00 18.69 
1969 26.99 29.28 41.45 20.42 20.36 
1970 30.08 30.25 46.02 21.84 22.11 
1971 40.02 31.02 54.09 22.90 26.47 
1972 43.97 39.54 56.88 24.60 29.64 
1973 46.74 49.80 65.51 25.36 34.50 
1974 72.74 68.88 69.93 28.18 72.92 


1975 124.40 123.00 135.95 45.50 128.90 
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Table 1 Monadic Functions 


Symbol Name Definition Example 
+ Conjugate +Ais A A 
7 
+A 
7 
- Negative -AisO-A A 
7 
~A 
“7 
x Signum xAis (A>0)-A<0 A 
When Ais positive a I 
results. When A is 7 
negative a 1 results. 
When Ais zero a0 x4 
results. 
1 
x(-A) 
ee! 
+ Reciprocal +Ais1:A A 
7 
+A 
0.14286 
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Symbol Name Definition Example 
| Magnitude Absolute Value B 
7.653 ~7.653 7.456 ~7.456 
|B 
7.653 7.653 7.456 7.456 
L Floor Least Integer B 
7.653 ~7.653 7.456 “7.456 
LB 
7°87 78 
[ Ceiling Greatest Integer B 
7.653 7.653 7.456 “7.456 
TB 
8°78 7 
? Roll ?A is random number ?A 
from set of (p A) 
consecutive integers 1 
with each integer 
having the (179A) 27777777 
probability of being 
selected. 6442155 
* Exponent eA A 
7 
*A 
1096.6 
@ Natural In A or log,A A 
Logarithm 
7 
@A 
1.9459 
co) Pi Times TxA o1 
3.1416 
C 
12 3 
oc 


3.1416 6.2832 9.4248 
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Table 1 Monadic Functions (Continued) 


Symbol Name Definition Example 
! Factorial !4=1~x2x...x{A-1)xA A 
7 
!A 
5040 
ry 
24 
~ Not ~1isO,~0 is 1, D 
Truth table defined for 
QO and 1 only. 10 
~D 
o10 
~~D 
101 
Table 2. Dyadic Scalar Functions 
Symbol Name Definition Example 
+ Plus Add 3 + 1.02 
4.02 
5 + 4 
9g 
- Minus Subtract 6 - 7 
“4 
x Times Multiply 5 x 4 
20 
+ Divide Divide 5 +4 
1.25 
| Residue Remainder after divide 5|25.010 
0.01 
L Minimum Smaller of two values 5b 4 
4 
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Table 2 Dyadic Scalar Functions (Continued) 


Symbol Name Definition Example 


[ Maximum Greater of two values 57 4 


7 OO 
* Power A to the B power: A? 3 * 2 
9 
Hoe 15 
2 
® General The base A logarithm of B 10@5 
Logarithm 
0.69897 
10@1005 
3.0022 
Symbol Name Definition 
re) Circular, 10X = Sine X 
Hyperbolic, 
and 20X = Cosine X 
Pythagorean 
Functions 30X = Tangent X 


HOX = (14+X*2)*.5 


SOX = SinhxX 
60X = CoshxX 
7OX = Tanh xX 


“40X = Arcsin XY 


~20X = Arccos X 
~30X = Arctan X 
THOX = C14+X*2)*.5 


“SoX = Arsinh X 


“60X = Arcosh xX 


tt 


~7OX = Artanh X 


It 


OOX = (14+ X*2)*.5 
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Table 2. Dyadic Scalar Functions (Continued) 


Symbol Name 

i Binomial 
Coefficients 

A And 

Vv Or 

*® Nand (not 
and) 

Ed Nor (not or) 


IA A 


IV tu 


HV 


Table 3 Dyadic Array Functions 


Less than 
Not greater 
than 

Equal to 
Not less 
than 
Greater than 
Not equal to 


Definition Example 
A\ _ A! The Bth term in the expansion 
ts) ~ (A — B)IB! of (X + Y)4, also the number of 
10 combinations of A things taken 
({5)) = stt0 = 252 B at a time. 


A and B must be logical (0 or 1) variables 


Result is 1 if relation holds (TRUE) and 
0 if it does not hold (FALSE). For example, 
§<7is1,5>7is 0. 


Function Symbol Scalar f Array Array f Array 
Plus + 3+48 °2 8341+ 246 
Yaa ie 58.7 
Minus = 3-48 °2 341-246 
AS Ae SO a 

Times x 3x4 8 2 341x246 
12 24 “6 6 16 6 

Divide + 3448 °2 S00 e' Oe dG 
0.75, 0.375. “445 1.5 1 0.16667 

Residue 3/4 8 2 34 1/2 4 6 
121 200 

Minimum L 3L4 8 2 3.4 112 4 6 
a any 241 

Maximum [ 3f4 8 2 34 172 4 6 
4 8 3 346 
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Table 3. Dyadic Array Functions (Continued) 


Function Symbol Scalar f Array Array f Array 

Power * 3* 4 8 2 Z34H1t* 246 
81 6561 0.1111 9 256 1 

Logarithm ® 3 @4 8 2 342@24 6 


1.2619 1.8928 0.63093} 0.63093 1 2.585 
Circle o) 30 4 8 2 3410246 


1.1578 6.7997 2.185 | 2.185 4.1231 ~0.27942 


Binomial ! 3.!482 341!246 
4u 56 0 016 

And A 1A1010 101041100 
1010 1000 

Or v 1vi010 1010v1100 
1111 1110 

Nand n 101010 101001100 
01014 0111 

Nor ¥ 1¥1010 1010¥1100 
0000 Oo01 

Less < 3<4 8 2 3 34 1<2 4 6 
1100 001 

Not greater s 384 8 2 3 3 4 152 4 6 
11tod Oo1ait 

Equal = 3=4 8 2 3 341=246 
0001 010 

Not equal # 344 8B 2 3 34142446 
1110 101 

Not less 2 3248 2 3 3412246 
00141 110 

Greater > 3>482 3 3 41> 246 
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Table 4 Mixed Functions 


Examples 
Name Form Definition Arrays used in Examples: 
Shape or oA Results in vector A B C: 
Size whose elements 
are the number of 123 10 20 100 200 
elements inA ifA 
is a vector, or the 4 5 6 30 40 300 
dimension of A if 
A is an array. 789 #50 60 
70 80 
pc 
3 
pA 
3.3 
pB 
4 2 
Ravel 5A Results in vector 5A 
whose elements 
are the elements 123456789 
of A in row order, 
Reshape ApB Reshapes the 2 2 oB 
ravel of B to shape 
specified by A. 10 20 
30 40 
24 0A 
1234 
5 6 7 
3.5 9A 
412345 
6789 1 
2345 6 
Reversal oA or Reverses elements oA 
eA in A. > reverses 
the elements 321 
along the last 
coordinate. © 65 4 
reverses the 
elements along 987 
the first 
coordinate. oA 
789 
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Examples 
Name Form Definition Arrays used in Examples: 
42 3 
oc 
300 200 100 
Rotate AB or The elements of 26C 
AB B rotated A 
positions. ¢ 300 200 100 
rotates elements 
along the last 2oA 
coordinate and © 
rotates elements 3.12 
along the first 
coordinate. 645 
9 7 8 
26B 
50 60 
70 80 
10 20 
30 40 
Trans- &A or &4 transposes QA 
pose A&B the axes of array 
A. ASB, 147 
arranges axes of B 
to conform to 25 8 
argument A. 
369 
&B 
10 30 50 70 
20 40 60 80 
1 1A 
159 
2 1 4 
147 
25 8 
3.69 
1 2 A 
12 3 
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Table 4 Mixed Functions (Continued) 


Examples 
Name Form Definition Arrays used in Examples: 
789 
Catenate A,A Joins two arrays Cy,15 
A,LIJB along last axis. 
A,LZ]B joins 100 200'°300. 4-2 24 5 
B toA on the /th 
axis. B.B B,LIIB 
10 20 10 20 10 20 
30 40 30 40 30 40 
50 60 50 60 50 60 
70 80 70 80 70 80 
10 20 
30 40 
50 60 
70 80 
Laminate A,[2]B Joins two arrays C,C.11¢ 
along a new axis, 
where J is not an 100 200 300 
integer, new axis 
is | J. 100 200 300 
C,[1.11¢ 
100 100 
200 200 
300 300 
0(A,[1.1J]A) 
3.2 3 
A,[1.1J]A 
12 3 
12 3 
45 6 
45 6 
789 
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Table 5 Mixed Functions 


Name Form Definition Example 


Take Nta If N is positive, 24C 
first N 
elements are 100 200 
taken and if 
negative, last “4 AG 
N elements are 
taken from 300 
vector A. 
2 2tA 


Drop NYA _ IfN is positive, 24C 
first N 
elements are 300 
dropped and if _ 
negative, last 14C 
N elements are 
dropped from 100 200 
vector A. 


Compress N/A _ Selects 01 0/C 
elements from 
A as deter- 200 
mined by zero 
one argument 10 0/A 
N. For each 1 
in N, the cor- 1 
responding 
element in A is 4 
selected and 
for each 0, the 7 
element is not 
selected. 10 0/[1]A 


Expand N\A Fills array 1011\C 
with alphabetic 
spaces or 100 0 200 300 
numeric zeros 
corresponding 1010 1\f1]4 
to zeros in the 
argument N. 12 3 


Oo fF 
ao 
oO 
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Table 5 Mixed Functions (Continued) 


Name Form Definition Example 
Indexing AL] — Selects ele- cli. 3] 
ments from A 
depending on 100 300 
expression en- 
closed in Al1;3] 
brackets. 
3 
A[1 2 333] 
369 
BL32] 
20 40 60 80 
{Index of A 1B Returns the 12345 6717 
index value of 
first occur- 7 
rence of BinA. 
Ov Ody Da 
3 
2-3 4? 1745 3 
5 6 2 
Index 1 A Generates first 110 
Generator A integers in 
order. 123456789 10 
Member- AeB Determines if 23 4 2 162 
ship each element 
of A is a mem- 10010 
ber of B. 
234 2 te2 3 
112010 
Grade AA Returns the Au 2345 
Up index values of 
A in ascending 42315 
order. 
ALAA] Sorts the ele- A4uo 20 31 10 55 
ments of 
vector A in u23145 
ascending 
DEAE: SCAS+4u 2315] 
12345 
Grade yA Returns the V42315 
Down index of values 
of A in Sd. Be 2 


descending 
order. 


Y4o 20 30 30 10 55 
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Name Form Definition Example 
ALYA1] Sorts the ele- 613425 
ments of 
vector A in SC¥S<40 20 30 30 10 55] 
descending 
order. 55 40 30 30 20 10 
Deal A?B Selects A 2°20 
random 
integers with- 185 
out replace- 
ment from iB , 10?10 
each integer 
has a(13pB) 657821012943 
chance of 
selection. 10710 
10943685217 
Matrix EA Produces the RND©2 2 pid 
Inverse inverse of a 
nonsingular RND 
matrix. 
13 
ui 2 
EIRND 
2115.9 
1-055 
Domino BBA Domino can be 5X + X +3X, = 7 
used to solve a 1 2 3 
set of linear 10X +3X +5X, = 10 
equations if A 7 1 . 
has the same 2X +0. 24,,4+8.34=0 
number of 1 
rows as col- A 
umns. Domino 
returns the Bes 
coefficient of 
least-squares 10 3:5 
regression if - 
the number of 20.2 8.3 
rows of A 
exceeds the B<«+7 100 
number of 
columns. BEA 


1.7881 3.4851 0.52485 
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Table 5 Mixed Functions (Continued) 


Name Form Definition Example 
AA 
5 13 
10 3 5 
Domino §) 2 .2 8.3 
273.5 0.5 
B 
fare @ ek Oat | 
BAA forces intercept 
through zero 
0.90461 0.20213 0.22486 
BEl1, AA catenate column 
of ones for intercept 
6.2918 0.26813 1.5607 0.7310 
Quad [kA Displays A and (k' CHARACTER! 
output generates a line 
feed. CHARACTER 
(SUMS++/B 
18 
Quote [A Displays A In a Function 
quad with no line tA ' 
output feed. Ue 1, wa" 
("k' CHARACTER! 
At Cee CHARACTER 
Quad A+{]_— Enters a line of A+{] 
Input input from a 
function. U1: 
42-3 
A 
12 3 
Quote A<( ‘Reads a line of A" 
quad characters and 2 
input creates a char- 
acter vector 
from inside a ACB 
function. 
A 
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Table 5 Mixed Functions (Continued) 


Name Form Definition Example 
Format vA Monadic format ('THE LOG OF',¥A),'EQUALS' ,7@eA 
monadic A results ina 
character rep- THE LOG OF 2 BQUALS 0.69315 


resentation of 
A so it can be 
catenated & 
displayed on 
same line with 
character data. 


Format A¥B Displays B A 
dyadic according to 
specification 5 1 3 
A. The first 
element of A is 10 «(3 5 
the number of 
columns and 2 0.2 8.3 
the second ele- _ 
ment is the 2 355 0.5 
print precision. 
5.000 1.000 3.000 
10.000 3.000 5.000 
“2.000 200 8.300 
250007. "3,500 . 2500 
7172 7 3A 
5.0 1.00 3.000 
10.0 3.00 5.000 
“2.0 -20 8.300 


2.0 3.50 .500 
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Answers to Exercises 


In this section a few answers to selected problems are presented. A 
complete set of solutions to both the APL exercises and statistical prob- 
lems is available from the publisher. 


y) (f} Domain error 
(k) 0 1 1.58 2 
(p) Domain error 
Statistical Applications (r) 2.5 
1. 15.00 (u) 100 
2. 13.33 yh le Be 
3. Fl 
a zteles “44-18 0 6 10 12 12 
4. 0.2457E-18 F3 
5, (a) 254.97166 552 56 0 2 0 6 56 
(b) 14.99 4. (a) 2 
(c) 5.02 (b) 7 15 
(d) 0.057656 (e) 1.83 
6. 36,383.88 5. (a) 2.928 
(d) 3085 
(g) “5 
3 (j) 110 
7. (a) 45 
APL Practice oo 
(h) 6.2E20 
1. (a) blank (1) 82.999 82.5 
(c) 0 
(e) 3 Statistical Applications 
(h) 3 1. a. 380.4 
2. (a) 1 1 1 1 c. 1.0905 
(d)1 4 9 16 e. 0.08515 
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Answers to Exercises 


2. a. 4.2 
4. a. 7.3% 
b. 0.8 
6. a. $.8 
b. 2.88 and 5.99 
7. $194.32 


4 


APL Practice 


lL (a O00000000000 
(e)67890123456 
(i) ~3. -2 
(i) 007271 
(0) Index Error 
2. (a) 3 random numbers from 1 to 6 with 
replacement. 
4, ((7p0),4p1)/Z 
We use the first 20 terms of each series. 
(a) 0.69 
(e) does not converge 
(f) 0.58198 
(i) =~.3 
(k) does not converge 
7, (a) Cauchy-Schwartz 
+ /X xX Ys (4+ /X#2) x (+ /Y«2) 
8. (e) type > 
(f) you will have an infinite loop. 


Statistical Applications 


2. The maximum value of S is 5 when P = .5 
and the minimum value of S is 0 when 
P= 1, 


3. Mean equals 5.920 
Variance equals 1.920 
Standard deviation equals 1.360 


) 


c. 0.447 

d. (1) 19.61, 20.51 
(2) 19.16, 20.95 
(3) 18.72, 21.40 


d. 2 

e. 0.6 

f. zero 

i. Ss = ~0.6 
S4 = 2.2 


APL Practice 


1. 
2. 


(0) (X[4X)[3] 


(a) -2 “1 “1001122 
22 1101122 
010101010 


(c) Same as Z 
3.246 3.246 ~—549 


(a) X + 2X + 3X + 4X 

(c) none 

(d) I 

The root is between 0.3169 and 0.3170 


Statistical Applications 


4, 


a. Mean = 2.6 
Variance = 3.3714 
Standard Deviation = 1.8361 
Mean Deviation = 1.64 
Median = 3 

b. Mean = 3 
Variance = 6.2727 
Standard Deviation = 2.5045 
Mean Deviation = 1.9048 


Median = 3 
a. weighted average = 2.044 
a. 12.5 
b. 159.17 
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APL Practice 


2. 


0.866 

0.500 

0.577 

1.732 

(b) V CHECK [10] 
[1] S(L.5++4+/+(0,150)!50) = 2V 
CHECK 
YES 


Problem 


F<M 
F2M 
F>M 


19.125 


(a) 84 
(c) 2 
166.67 


Answer 
(~F)AM 
Fy~M 
FA~M 


Statistical Applications 


1. 


2. 


4, 


a. N and M 


e. | MNTS X 
0 
2 MNTS X 
2.66 
3 MNTS X 
0 
4 MNTS X 
15.46 


a. The sixth element of the series 10 bi .3 
is 0.1029. 
d. 1—+/Spl10B/ .3 or 0.16 


for P = .5 and K = 4 the probability 
F(N) = 0.0625 


12. 


14, 


15. 


17. 


7 


2 POISSON 1 
0.1839 


Mean = 5 

Variance = 2.5 

Third moment = 0 

Fourth moment = 17.5 

Like the normal this distribution is bell 
shaped but less peaked than the normal. 


Mean = 6 

Variance = 2.4 

Third moment = ~0.48 

Fourth moment = 16.224 

The distribution is skewed to the left 
(since V, < 0). 

a. 6 = 1/1008C 

c. 0.055 

h. 0.94 

a. 0.6 

d. 1.39 

e. 6.11 


APL Practice 


1. 


A (a) 123456789 10 

(f) ABC DA BC D8 
B (d) A BC DEF 

(e) A B C DE FE F 
(d) MEAN 

These data are for part D of Exercise 
No. 2 
CI: 

(7@8,3) x 1100 
54, 921 


)WSID 
CLEAR WS 


(b) (+/DATA) + pDATA 
(d) 83325 


[3.5] > EXIT xt 1=//‘FINISHED’ = 
8pDISP 
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Statistical Applications 9 
2. a. ~3.8696E717 
b. ~0.84543 
3. a. 0.06 0.22 0.31 0.26 0.1 0.03 0.02 DENCE TACURe 
b. .59 .41 I. (c) U<--Y-A-BxX 
4. TSX —NORM N (e) SU<S+(pX)-2 
[1] X — ? (N,30)p10 (k) VYXO<—XU x((XO —XB)*2)+ 
[2] SX — (((+/X)+30)—5.5)+ XSQ)++N 
(8.25+30)*.5 V 5. F(20) = !19 
F(3.5) = !2.5 
Statistical Applications 
8 1. a. R? = 0.043 
d. R* = 0.044 
APL Practice f. R? = 0.036 
1. (a) 7 3. Since F = .507 for the column means 
12 we are unable to establish a dependency 
(d) 1 YOU on dusting methods. 
(e) WTACWTACWT 4. a. F for column means = 10.4, F(3,12) = 
WWWWWWWWWwrAc 3.49 so we reject hypothesis of no 
2. They must be scalars difference. 
. 7. Row means F = 2.3401 
8: ne i eas Vue “Y-M Column means F = 0.045685 
me (4) oc riGe MY not sir Me) Row & Col. means F = 1.4223 
4. [3]->(N = PR)/2 a) Ho not rejected 


b) Ho not rejected 
Wel K c) Ho not rejected. 
11. (c) (12.2|[(OLX TTEST Y; N1; SS; X1; X2; 


NT; C;G;A;N2;NO 


Statistical Applications 1 0 
2. The result of A TTEST B is ~1.15 and 
the ¢-value from the f-table for 90% 
confidence is “1.812. Thus the program APL Practice 
was not effective. 


2. (a) 2 -5 

3. 0.15 -1 3 
4. a. Computed poisson frequencies are (b) syntax error 

223.13 334.7 251.02 125.51 (c) 2 71 

47.068 14.12 3.53 0.75643 -5 3 

0.14183 0.023683 (k) 123 78 


§. a. $32.50 44 28 
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10. 


12. 


Answers to Exercises 


Left inverse is given by # Z 
((EZ+ .x Z)=(QZ)+. XB) Z 
results in 

1 1 

1 1 


2.3 1 

“3.7 1 
4.7 1 

The rounding of the constants changed the 
solutions dramatically. 


a to g result in a matrix of ones proving 
that the statements are true. 


A=A+.XA and A=(8A)+.xXA result in 
matrices of ones. 


36.5 and 55.7 


Statistical Applications 


1. 
2. 


a. G4, = 8, d. = $,var a, = 4, var a, =$ 
x = 909.91, y = 561.12, 

@ = ~39.532, b = 0.66012 

t = 69.824, R? = 0.99449 

a. x = 909.91, y = 46.934, 


G = ~38.313, 6 = 0.093688 
t = 32.659, R? = 0.97531 
b. ¥ = 8.9271ES5, y = 46.934, 


= 2.4623, 6 = 4.9817E-5 
t = 45.556, R? = 0.98716 


> 


11 


APL Practice 


4. 


5. 


(b)1 1 1 0 I\[2]X 
(c) 1 00 14 X 
(d) 1, “1 44 7K 
C—((0, 09) 0.!0, 29) x 10 
then 


(C+.xC)x(210)0.=(410) 
gives a matrix of ones. 


10 pl ~1 


10. 


A<2 2pl 0 0 2 
B<2 293 0 0 1 
(A XB)=BxA 

1 1 

| I 
1. +/[2]+/]3JALL 
2. [ /f2]+/BJALL 
3. +/+/+/ALL 


Statistical Applications 


1. 


Macro [;4] Regress Macro [;10] 
xX = 5.3714, y = 135.76, Gd = 44.673, 
b = 16.957, 

t = 9,837, R? = 0.78185. 

Macro [;4] Regress Macro [;10] 
after changing lines [4] and [5] to 
[4] SSE<—225 x (n—2) 

[5] V<—225 

x = 5.3714, y = 135.76, 

G@ = 44,673, 6 = 1.8179, 

t = 0.5126, R? = 0.86654 


12 


APL Practice 


1. 
5. 


10. 


13. 


The required number is the mean of L 


QO is alist of N random numbers from 1 to 
K 

(a) (2100) o.*110 

(c) (4100) o.*.1x 210 

(f) $30 30p(.2*0,229),.2 for p=.2 

(a) 11.038 

(b) 11.338, 11.15, 11.038, 11.225 

(b) a row of ones 

(c) you obtain c 


(g) a matrix whose rows is either ones or 
not c. 


Answers to Exercises 
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Statistical Applications 


1. 


10. 


a. Beta coefficients are 2.6706 and 
~129.42 
their t ratios are 32.209 and ~1.7648 


Coefficients and corresponding t-ratios 


a* ay As 
1.0223 0.42378 0.49806 
(0.027089) (18.425) (2.3033) 
Coefficients and corresponding t-ratios 
ao a, ay 
2730.8 — 1.5229 50.688 
(2.9563) (—2.1977) (6.3059) 


b. y = .9, b, = ~.14, b, = .37, 
by = 57.67 

t ratio for C; = ~1.9948 

t-statistics 

—2.3658, 51.987, 3.9462, 

— 4.5486, 3.3161, 0.34169 

F = 25.888 


a. F statistic = 5.1094 with 30 and 120 
degrees of freedom. 

b. F statistic = 2.3604 with 30 and 120 
degrees of freedom. 


ay = 7.7956 and a, = ~0.32873 
Elasticity = —.32873 
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3. 


11. 


12. 


t-statistic for ag = 2.35 thus the H, is 
rejected. 
F statistic = 197.7 and H, is also rejected 


a. equilibrium P = | & QO = 3. 


b. TTest | 
—0.23881, 0.79786, 0.79786 


Beta coefficients = ~569.26, 2.6626, 
19.659 
t-ratios = ~6.4608, 204.67, 7.0156 


C. do = 2.7769, a, = 0.80121 
ay = ~10.298, a, = 1.1005 


coefficients t values 
I-y = 0.55511 5.9576 
Yp = 0.30999 5.2282 
Yq = 722.376 ~3.316 


Using Y 1978 aad 1,385.1 
and Cy977 = 857.7 
we obtain 
Cig7g = ~22.376 + .31(1,385.1) 
+ .56(857.7) = 883.11 
which is close to the actual value of 891.2. 
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Absolute value 35-36, 49, 62, 67 
Account no. 294 

Addition 7, 12 

Addressable cursor 25 
Administrative procedures 1 
Aitken's generalized least squares 275 
And 79 

APL program for linear regression 145 
APL keyboard 2, 298-301 

Arithmetic functions 6-10, 12 
Arithmetic mean 20, 28 

Arrays 10-12, 172 

Arrays: parity 36 

Assignment 17, 28 


B 

Barone, J. 303 
Barron, J. 76 
BASIC 7 


Basic statistics 58 

Beaton, R. D. 303 

Benchmarks 302 

Bibliography 330 

Binomial coefficients 73 

Binomial distribution 75 

Binomial probability 74 

Blanks 10, 12, 28 

Branching 45, 50 

Breaking up the sum of squares 145 


Cc 

Case study in program develop- 
ment 127 

Catenate 40, 49, 210 

Cathod ray tube 25 

Ceiling 62, 67 

Character arrays 78, 98 

Character input 103 

Character set 304 

Checking parentheses 24 

Checking results 132 

Chi-square 148, 153 

Chi-square distribution 149 

Clear WS 2 

Column vector 172 

Combination of R things N atatime 74 

Command key 7 

Comment 163 

Compression 79, 81, 197 

Computers Ltd. 301 

Concave functions 92 

Conditional branches 45, 46, 50 

Contingency tables 149 

Contingency table routine 153 

Continuation 27, 29, 101 

Convex functions 92 

Correcting a defined function 47 

Correcting a function line 60 

Correcting typing 25, 42 

Correlation matrices 240 

Covariance 73, 240 
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Index 


Covariance and correlation mat- 

rices 240 
Cross products 72 
Cumulative Chi-square 161 
Cumulative F distribution routine 162 
Cumulative Poisson distribution 76 
Cursor 25 


D 

Data and information 97 

Data set ‘Macro’ 310 

Deal 43 

Debugging 127 

Del 41, 50 

Deleting a line ina function 119 

Determinant 231 

Determinant routine 234 

Diagnostics 12] 

Digital Equipment Corporation 294 

Display 48, 51 

Display numeric and character data on 
one line 99 

Display one or several lines of a 
function 117 

Distributions 73 

Divide 9, 12 

Domain error 9, 98 

Domino ‘180 

Drop function 214 

Dummy variables 63, 67 

Durbin-Watson statistic 261 

Durbin’s estimator in first order autore- 
gressive models 278 

Dyadic compression 82, 197 

Dyadic expand 199 

Dyadic functions 16, 28 

Dyadic reshape 78 


E 

e 22 

Elementary linear regression 144 

Elementary matrix operations 174 

Elementary statistics 15 

Entering data inside a function 101 

Entering data on two lines 27, 29 

Entering function definition mode 41, 
50 

Equal 79 

Estimators of regression coefficient 144 

Estimators of the variance of regression 
estimators 144 

Execute 6-8, 18, 27 

Execution mode 2 

Exit from function 47 

Expand 193, 199 

Exponential function 22, 28 


F 

Factorial function 74 

Features of APL 2 

First order autoregressive model 279 
Floor 62, 67 

Format 100 


F-Statistic 154 

Functions and arrays 11, 12 
Function definition 41, 50 
Function display 117 
Function glossary 316 
Function headers 43, 50 


G 

Gamma function 160 
Generalized least squares 275 
Generating random numbers 43, 49 
Getting started 6 

Geometric distribution 93 
Geometeric mean 21, 28 
Global variables 63, 67, 135 
Goodness of fit 148 

Grade down 38-39, 49 

Grade up 38, 49 

Greater than 79 

Greater than or equal to 79 


H 

Hamburg, M. 128 

Harmonic mean 23, 28 

Heteroskedastic models 259 

Hewlett-Packard 294, 299 

Higher and cross product moments and 
distributions 72 

Higher-order arrays 191 

Higher order sample moments 72 

Histograms 76 

How mistakes begin 129 

How to calculate a mean 19, 28 

How to write your own function 34 


I 

IBM 1, 6, 294, 307, 234-235, 298, 307 
Identity matrix 179 

Immediate execution mode 2 

Index error 38 

Index generator 46, 49 

Indexing arrays 37, 49, 193 

Inner product 27, 29, 221 

Inner product generalized 222 

Input continuation 101 

Insert a new line 118 

Interactive programming 44 
Introduction to linear regression 144 
Instrumental variables 272 

Invalid variable names 18, 28 
Inverse 179 

Inverse of a nonsquare matrix 182 


J 
Johnston, J. 268, 279 


K 

Kaplan, W. 16] 

K-Class estimators 282 

Keyboard 2, 295, 298-301 

Keying conventions 6 

Kmenta, J. 26, 268, 269, 276, 285, 288 


Index 


Kronecker product 229 
Kurtosis 94 


L 

Label 45 

Lagged variables 259 

Lagrange multiplier 290 

Laminate 212 

Left inverse of a nonsquare matrix 18] 

Length error 12, 98 

Less than 79 

Less than or equal to 79 

Limited information maximum likelihood 
estimators 288 

Linear regression 183, 240 

Line continuation 27, 29 

Line labels 45, 50 

Local variables 65, 67 

Logarithm function 22, 28 

Logic of residue function 36 

Logical functions 79 

Log-Off procedure 297 

Log-On procedure 1, 295 

Longley benchmark 302 

Longley, J. W. 302 

Looping 232 

Lost results 18 


M 

Macro, data set 310 

Making a program interactive 103 

Matrices 172 

Matrix addition 175 

Matrix algebra 172 

Matrix inverse 179 

Matrix multiplication 176 

Matrix subtraction 175 

Matrix transpose 178 

Maximum 77 

Maximum likelihood estimator 288 

MCM System 900 201 

Mean 19, 28 

Mean and variance of probabilities 27, 
29 

Median routine 58 

Minimum 77 

Minus 8, 12 

Mistakes, how they begin 129 

Modem 296 

Moment 240 

Monadic format 100 

Monadic function headers 44, 50 

Monadic functions 16, 28 

Multiline data input 101 

Multiply 9, 12 


N 

Negative numbers 8, 12 
New line, insert 118 
Niladic function 41, 50 
Non-singular matrices 179 
Normal distribution 83 
Not and 79 

Not equal to 79 


‘Not or 79 
Numeric arrays 97 
Numeric input 102 


0) 

One-way analysis of variance 154 
One-way anova 154 

OR 79 

Ordering arrays 37, 49 

Outer product 131, 152, 221, 223 
Overstruck characters 306 
Overview of APL 1 


P 

Parentheses 23 

Partial correlation coefficients 246 

Password 294 

Pearson, E. S. 87 

Pendent functions 126 

PI times 84 

Poisson distribution 75 

Power 22 

Private library 109 

Problems from careless use of global 
variable 135 

Procedures, administrative 1 

Production functions 225 

Program development 127 

Program errors 127 


Q 

Quad 27, 29 
Quad-divide 180 
Quad-input 102 
Quote 44, 50 
Quote-quad input 103 


R 

Random numbers 43, 49 

Rank error 46 

Ravel 57, 66, 173, 209 

Reciprocal 17, 28 

Recursive programming 76 

Reduction 19, 28, 191 

Regression routine 146, 254 

Regression with restricted 
coefficients 277 

Removing a suspended function 48, 51 

Reshape 173, 151 

Residual routine 148 

Residue 34-35, 49 

Restricted coefficients 277 

Return 6-10, 27 

Reverse or rotate function 200 

Rho 20, 28 

Right inverse of a nonsquare matrix 182 

Right to left order 15, 28 

Roll 42, 49 

Rotate 200 

Row vector 172 


Ss 
Sample median 34, 49 
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Sample results of regression rou- 
tine 147 

Sample standard deviation 24, 29 

Sample variance 23, 29 

Saving workspace 106 

Saving workspaces on microcom- 
puter 111 

Saving your workspace 307 

Scan 21, 28 

Shape 20, 28 

SI Damage 48, 51, 104, 126 

Simple and partial correlation 
coefficients 245 

Simple least squares regression 144 

Simpson’s rule 91 

Simultaneous equation models 268 

Singular matrices 179 

Smillie, K. W. 76 

Sorting an array 39 

Standard deviation 24, 29 

‘‘Standard’’—normal density 
function 84 

State diagram 2-3 

Statements automatically renum- 
bered 119 

Standard deviation routine 58 

Stop 125 

Student’s t 128 

Student’s t test routine 135 

Subtraction 8, 12 

Sum of squares 145 

Suspended functions 126 

Syntax error 8, 12, 17 

System commands 18, 28 

System of linear equations 183 


T 

Take function 213 
Terminating input request 103 
Trace 123, 136 

Transformed variables 258 


’ Trapezoidal rule 91 


Transpose 205 

Truth table 79 

Two-way analysis of variance 156 
Two-way anova 156 

Two-way classification 150 
Two-stage least squares 269 


U 
Unconditional branch 45, 50 
Using APL in the calculator mode 132 


Vv 

Valid variable names [8, 28 
Value error 64, 67 

Variables and assignment 17, 28 
Variable names 18, 28 

Variance 23, 29, 58 

Vectors 172 

Vectors matrices, and arrays 172 


Ww 
Work space 2 
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Symbol Index 


, 12 

12 

12 

, 12, 28 

18-19, 28 

19, 28; 79; 191 

20, 28; 78; 151; 173 
20, 28; 193 

22, 28 

22, 28 

27, 29; 121 

27, 29; 101 
34-36, 49; 62, 67 
38, 49 

38-39, 49 

40, 49; 57, 66; 173; 209; 
212 

41, 50 

42, 51, 109 


WON ONN 


48, 51, 60, 109 
48, 51; 104; 126 
62, 67; 77 


7 
e) 

e 

«Q 

cae 

) CONTINUE 
)WSID 

) SAVE 

)LOAD 

)LIB 

)SAVE CONTINUE 
) COPY 

)PCOPY 

) DROP 

)FNS 

0 
TA 
SA 
)SIV 


° 
. 


° 


x 


+7 8 0 


62, 67; 77 
84 
99 
100 
102 
103 
106 
107 
107 
107 
108 
108 
108 
108 
109 
110 
117 
123 
125 
127 
131 
152 
180 
200 
205 
213 
214 


